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A method to construct Monte-Carlo event generators at arbitrarily non-leading 
order is explained for the case of a non-gauge theory. A precise and correct treatment 
- - , of parton kinematics is provided. Modifications of the conventional formalism are 

i required: parton showering is not exactly the same as DGLAP evolution, and the 

\ external line prescription for the hard scattering differs from the LSZ prescription. 

^ ' The prospects for extending the results to QCD are discussed. 
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INTRODUCTION 



There are two contrasting approaches to making perturbative predictions from QCD. The 
first consists of the analytic (or matrix element) methods where a factorization theorem 
is used to write inclusive cross sections in terms of perturbatively calculable hard-scattering 
coefficients and of parton densities and fragmentation functions. The second approach is 
that of the Monte-Carlo event generator 0. Now, the analytic methods have the advantage 
of permitting calculations to arbitrary non-leading order, and thus of allowing a system- 
atic improvement in the accuracy of calculations. In contrast, the Monte-Carlo methods 
^ ■ have the advantage of generating complete events and of implementing QCD predictions 
for the detailed structure of the final state, but at the expense of model dependence in 
k>( ; the hadronization. But up to now, a systematic method of incorporating higher order cor- 
;h ' rections, in powers of a^, has not been available for the Monte-Carlo method. Currently 
^ available event generators incorporate hard scattering and evolution at the leading order 

(leading- logarithm approximation, and somewhat beyond), and, although some implemen- 
tations of non-leading corrections have been made p, ^, there has not so far been devised 
a general method. 

It is of course highly desirable to extend the algorithms in the event generators to non- 
leading order. Once this aim has been achieved, one can use the highest precision QCD 
calculations while retaining the advantages of the event generators. The many currently 
available NLO and NNLO analytic calculations of Feynman graphs will of course continue 
to be used; they will merely need to be reformulated to be used in event generators. 

In Ref. 0, I proposed a subtractive approach and applied it to the simple, but phe- 
nomeno logically important, process of photon-gluon-fusion in deep inelastic scattering. This 
approach was intended to be generalized to provide a complete solution to the problem of 
non-leading corrections to event generators. Unfortunately, a full treatment of QCD is quite 
hard, because of the need to treat soft gluon effects. So the present paper restricts its 
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attention to providing a complete treatment within a model theory where only collinear 
configurations are important. 

For ease of exposition, only the rather unphysical case of 0'^ theory in 6 space-time 
dimensions will be discussed, but the methods immediately generalize to any non-gauge 
theory^. In such a theory, we will see in complete generality how to construct Monte-Carlo 
event generators for e^e~ annihilation with the inclusion of arbitrarily non-leading order 
corrections, in the hard scattering and in the final-state showering. The irreducible errors 
are then due to power-law corrections to the hard scattering and evolution kernels. The 
extension to initial-state showering should be relatively elementary. 

The general principle of the method is to generate extra kinds of hard interaction and 
splitting kernels corresponding to non-leading-order corrections, and to apply subtractions to 
allow for the effect of lower-order terms coupled to showering. The subtractions are applied 
point-by-point in momentum space with proper allowance for correct parton kinematics, 
so that the NLO corrections form completely well-behaved functions. This is in contrast 
to the subtractive methods used in parton-level Monte-Carlo calculations for infra-red-safe 
observables, as in Ref. 0. In those methods the NLO corrections are singular distributions 
which can only be used to calculate suitably infra-red-safe observables. The new method is 
applicable to general cross sections. 

The viewpoint in this paper can be characterized as "static": It asks what the value is 
of a certain sum of integrals for the cross section. This contrasts with the more dynamic 
viewpoint normally used in discussions of event generators, where the primary concept is 
the evolution of partonic states. An actual implementation of the methods of this paper will 
recover this dynamic point-of-view, for that is inherent in the structure of the Monte-Carlo 
algorithms derived from the static Feynman graph structure. 

Although the model theory treated in this paper is relatively simple, even this case 
already poses some quite non-trivial difficulties if one is to solve completely and reliably 
the problem of obtaining corrections for use in an event generator. Arbitrarily high-order 
Feynman graphs are being approximated in such a way that an estimate of the cross section 
for producing particles needs computational resources proportional to A^, even though a 
brute force calculation requires resources more like A^!. 

Since the mathematical structure worked out in this paper is quite general, I hope that 
the same structure and proof will apply in many situations, with minor modifications. The 
essential problem to be solved in this and any other problem in asymptotic behavior in 
quantum field theories (including QCD) is to find an economical way of treating the large 
multiplicity of different regions that give the leading contributions from a single graph. Any 
economical method must treat exactly one region at a time, and involve only simple and 
very general relations to other regions. 

In addition, the methods used in the proof should be of use in algorithms for QCD 
calculations (both numeric and symbolic). Because of the many singularities in unintegrated 
graphs, it is important to have robust control over the sizes of the contributions from the 
different regions, and to have a very concrete definition of subtracted integrands, which are 
to be well-behaved numerically. The methods of proof in this paper, with their simple but 
strongly recursive structure, are appropriate for generalization for this purpose. 

In Sec. |1| are summarized some of the general ideas used in this paper. Then, in Sec. fT|, 
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we define the cross section that is to be treated and construct the basic elements that are 
used in the algorithm. The basic ingredient used to construct the factors in the factorization 
theorem is an approximation where the external lines of the hard-scattering are set on- 



shell. Precise definitions of this approximation are given in Sec. The definitions are 



arranged to be suitable for a Monte-Carlo event generator, where it is essential that parton 
4-momentum is exactly conserved; approximations that change the total 4-momentum of 
the scattering have to be used carefully. Before giving the general proofs, we show, in Sec. 
0, how the methods are applied to specific Feynman graphs. In Sec. a general proof 
of factorization is given in the form that it is needed to obtain an algorithm for a Monte- 
Carlo event generator. This section also includes a proof of the coUinear safety of the hard 
scattering and the integrated jet factors, as well as a discussion of the renormalization group 
properties of the factors in the factorization theorem. These are all essential to an actual 
implementation of the algorithm. In Sec. [VI I| , we then obtain the Monte-Carlo algorithm; 
this is the culmination of the theoretical treatment. Sec. [VII1| shows examples of complete 
calculations of corrections to the hard-scattering, at NLO. Finally, some conclusions are 
presented in Sec. |X| and some of the difficulties that remain in extending the method to 
QCD are explained. 



II. MAIN FEATURES OF THE NEW METHOD 
A. Previous methods 

The methods used to treat higher-order corrections in the analytic methods, even within 
a Monte-Carlo approach [|^ , do not directly translate to methods for event generators. One 
of the reasons is the way in which approximations are made on the kinematics of events. 
Consider, for example, the production of two or three partons (in e"'"e~-annihilation). The 
lowest order graph for 2-parton production, shown in Fig. |l|, is assigned to an exactly back- 
to-back configuration. In the lowest order graph for 3-parton production. Fig. |], the 3 
partons are not back-to-back. However there is a singularity in the 3-parton calculation 
when pairs of the final-state particles approach each other and make a 2-jet configuration. 
This part of the cross-section is really a 2-jet process with an order contribution to the 
fragmentation of one of the jets. 

In the standard calculation of the hard-scattering coefficient for 3-parton configurations, 
a subtraction is made of a fragmentation factor times the 2-parton cross section, which is 
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localized in exactly the back-to-back situation. Let qx be a transverse momentum that pa- 
rameterizes the deviation from a back-to-back configuration. Then the resulting contribution 
to the hard-scattering coefficient for the 3-jet cross section is of the form: 



:6^'\qT), (2.1) 



plus some irrelevant non-singular terms. Without the -|- prescription, the cross section has a 
divergent integral over qt- Although the subtraction at = produces a valid, integrable 
distribution (or generalized function), it does not produce an ordinary function. To imple- 
ment a 3-jet cross section in an event generator one should have an ordinary function, not 
a distribution. 

A related difficulty is that after showering and hadronization of the final-state partons, the 
two graphs. Figs. ^ and ^ produce hadrons in overlapping regions: one cannot distinguish 
between an event that is derived from exactly back-to-back partons, as in Fig. |l|, and an event 
derived from an almost back-to-back configuration from Fig. 0. However, the distribution in 
Eq. ( |2.1| ) does make such a distinction. Moreover from the large qt tail of the hadronization 
of the 2-jet cross section one obtains final states with 3 (or more jets). 

The simplest solution to these problems is to compute an infra-red safe observable, like 
thrust. This is commonly done with jet production in e^e~ annihilation [0. But this is 
not suitable in an event generator, where exclusive final states are generated and arbitrary 
observables can be estimated. 

Another method is commonly used in hadron-hadron collisions, where the cross sections 
for all jet observables involve parton densities, so that purely perturbative calculations are 



never IR safe. This method is to convolute Eq. (|2.1| ) with some kind of smearing function, 
that intuitively is supposed to represent intrinsic transverse momentum of partons relative to 
corresponding hadrons. Without further justification, this is just an ad hoc prescription that 
is used in a attempt to model a physical phenomenon that is clearly present. Furthermore, 
some of the qt smearing is contained in higher-order corrections to the hard scattering. To 
implement a valid calculation, it is necessary to devise a subtraction method that is derived 
from the underlying field theory and that prevents double counting; in a sense this gives a 
correct method to implement qx smearing. 

One very different approach that has been used to implement NLO corrections in event 
generators is to reweight events after they have been generated, so that the hard scattering 
cross section is renormalized to the value known from analytic calculations. In addition, 
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FIG. 3: General leading-power contribution to cross section 

the unsubtracted NLO correction is used to populate regions of phase space not filled by 
showering the partons associated with the LO cross section. This method does not readily 
generalize. 

B. Overall structure of new method 

The basic idea of the new method is that the subtractions should be computed with the 
correct parton kinematics, rather than a zero-transverse-momentum approximation such as 
is implicit in the +-distribution in Eq. This means that we must always remember 

that graphs like Figs. |l| and |] do not fully represent the physics. The parton lines going into 
the final state should really be replaced by bubbles representing hadronization, as in Fig. 
which represents the most general leading region. 

Apart from such implementation issues, the basis of the method is the usual factorization 
theorem: As illustrated in Fig. ^, the leading power contributions involve a hard scattering 
convoluted with jets, each jet being induced by a single parton of relatively low virtuality. 
The algorithm for the event generator makes recursive use of the factorization theorem by 
applying it to each of the jets whenever the invariant mass of the parton initiating the jet 
is much larger than a typical hadronic mass. In this second application of factorization to 
the jets, the hard-scattering coefficients are equivalent to the DGLAP splitting kernels. 

Current event generators are based on a leading-order approximation which involves the 
use of the lowest order expansion of the hard-scattering and splitting kernels together with 
Sudakov form factors that encode the renormalization-group transformation between the 
potentially widely different scale in the different applications of factorization. 

What has been gained by the use of factorization is the ability to compute an arbitrarily 
large number of high-order Feynman graphs in a time that is merely proportional to the 
number of particles in the final state. The price of this is that approximations have been 
made, both as regards the kinematics of individual graphs and as regards the subset of 
graphs that is calculated. It is the proof of the factorization theorem that shows that the 
approximations are valid and useful. 

At each stage of the algorithm, there are errors due to higher-order corrections to the 
relevant hard scattering. The higher-order contributions to the cross section are computed 
by subtracting from the bare graphs contributing to the H subgraphs in Fig. |] the con- 
tributions at the same order that are allowed for in the lower-order calculation combined 
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with showering. To devise an algorithm one must: (a) define precisely what is being com- 
puted, (b) define precisely the kinematic approximations used in the hard scattering, and 
(c) implement the subtractions point-by-point in momentum space. The subtraction terms 
are computed with the same kinematics as the bare term, to satisfy the last requirement, 
while the kinematic approximations are only used in the computation of the numerical val- 
ues of the matrix elements, but not in defining the parton momenta. A similar treatment 
of initial-state showering will require us to employ parton densities differential in transverse 
momentum, or some equivalent concept. 

Once the method is formulated properly for the higher order corrections to the hard 
scattering, it is readily extended to the evolution kernels, as we will see. 

One potential problem is that, as is well known, subtraction methods tend to generate 
terms in the cross section that are negative. The reason is that if, as is possible, a next-to- 
leading (NLO) correction decreases the cross section compared with the leading-order (LO) 
cross section, then the NLO term is negative. The virtual corrections to the hard scattering 
can simply be added to the LO term; in the method proposed in this paper, these virtual 
corrections will always be collinear and infra-red safe, even in QCD — see the work of Collins 
and Hautmann p|, § — since the collinear and soft pieces are supposed to subtracted. Thus 
the virtual corrections cause no problems of positivity. 

As for the real NLO corrections, they are treated as generating a separate class of events. 
Where their contribution to the differential cross section is negative, these events must be 
generated with negative weight. This is often considered undesirable, but as explained in 
0, this is actually acceptable for most purposes, provided that only a reasonably small 
number of events with moderate negative weights is generated. Then the negative-weight 
events are outweighed by larger positive contributions to a binned cross section. What is 
not acceptable is to have the cross section be obtained as a cancellation of relatively much 
larger positive and negative contributions. An extreme example of this is given by Eq. ( p.ll ). 
There the distribution consists of an unsubtracted term, which integrates to 

positive infinity, and a delta function with an infinite negative coefficient. It is also essential 
that all regions of final-state momentum space that are populated by negative-weight NLO 
events are also populated by positive- weight LO events; thus the cancellations that give a 
positive physical cross section occur point-by-point rather than between neighboring bins. 
Both these conditions are fulfilled by my algorithm. 

In addition, the Monte-Carlo formalism contains some cut-off functions. When the higher- 
order corrections are included, there is a renormalization-group invariance under changes of 
the cut-off functions, and these can be adjusted to at least reduce the number of negative- 
weight events. 

III. DEFINITION OF PROBLEM 
A. Model for e+e annihilation 

Let us use 0^ theory in 6 space-time dimensions, which has the Lagrangian 

= \d<t^' - - 1<P'- (3.1) 

In addition, to mimic an interaction with a photon, let us suppose that there is a weakly 
interacting scalar field A, which will be called a photon, and that the interaction with the 
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field is proportional to Aj where the "current" j is the renormalized operator^ To 
emphasize the analogy that is intended with QCD, let us call the "parton" field. 

The process we work with is an analog of e~^e~ annihilation to hadrons. It will be 
necessary to consider various cross sections (including, for example, the total cross section, 
jet cross sections and completely exclusive cross sections), so to unify the treatment I define 
a weighted cross section by 

a[W] = Kj2W{f) I rfSe^''-^(0|j(x)|/)(/b(0)|0). (3.2) 

Here is the total incoming momentum, j{x) is the current operator defined above, and 
is a standard factor that depends only on the leptonic variables. The sum over / denotes 
an integral and sum over all final states ^, and W{f) is a weighting function that defines 
the particular cross section under consideration. All cross sections can be obtained if a\W] 
is known for all W . Let us work in the overall center-of-mass frame, = {Q, 0). 
Examples of the choice of the weighting function are: 

• The total cross section, given hj W = 1 for all final states. 

• The 2-jet cross section, for which W{f) = 1 for all final states containing exactly two 
jets (given some definition of a jet), and W{f) = otherwise. 

• A fully differential two particle cross section, for which W{f) is the appropriate 6 
function for two particle final states, and for which W{f) = for final states containing 
three or more particles. 



B. Leading-power contributions 

Our aim is to decompose the cross section into its completely exclusive components, and 
to obtain an algorithm for calculating them to the leading power in Q. Now, the usual 



power-counting theorems of Libby and Sterman |]T0|, which are formulated in the overall 
center-of-mass frame, show that the leading-power contributions to the cross section in our 
model theory all come from regions of the form symbolized in Fig. |^. Some lines, forming 
the hard subgraph H and its complex conjugate, are off shell by order Q^. The other lines 
form groups, or jets, of lines of approximately parallel momentum. These groups are labeled 
Ji . . . Jn, and to satisfy momentum conservation there must be at least two jets, N > 2. 

Unlike the case of QCD (or any other gauge theory), there are no soft subgraphs in our 
model 0^ theory, at the leading power. This fact alone considerably simplifies the analysis 
in our model compared with QCD. 

Individual Feynman graphs for the cross section can contribute in several different regions. 
For example, consider the cut graph, F, of Fig. ^, which has four particles in the final 
state, of momenta pi, p2, ps and p^. The graph has the following regions for leading twist 
contributions: 



^ The square bracket notation used in this context denotes a renormahzed operator. 

^ It is probably most useful to treat the sum and integral over final states as being over all states without 
regard to the constraints imposed by indistinguishability. Then the overcounting of physically identical 
states is to be compensated by inserting a suitable factor, which for our model theory is 1/Nf\, where Nf 
is the number of final-state particles. 
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FIG. 4: Graph with several leading regions 

1. pi and p2 are almost parallel, and and p4 are also almost parallel. 

2. p3 and p4 are almost parallel, but pi and p2 are at wide angle. 

3. pi and p2 are almost parallel, but and p4 are at wide angle. 

4. All the final-state momenta are at wide angles with respect to each other. 

There are of course contributions from intermediate regions, and these give the leading- 
logarithmic part of the cross section. The techniques of our proof will avoid any need to 
consider these intermediate regions explicitly. 

C. Summary of proof of factorization and application to event generator 

A summary of the proof of the factorization theorem in perturbation theory is as follows: 

1. Write the cross section as a sum over graphs F. 

2. Write each graph as a sum of terms Cb!^\ one for each of its leading regions R: 

F = ^ C'ij(F) + non-leading power of Q. (3.3) 

regions R 

3. Sum over all possibilities: 

final states / graphs F 

= ^ EE <^i?(r)W^(/) + non-leading power of Q. (3.4) 
/ r,i? 

Here K is the same overall factor that we used in Eq. ( |3.2| ). 

4. Now Cij(F) (to be defined later) is the product of a hard part and a set of jet factors, 
with these factors corresponding to particular contributions to the subgraphs H and 
Ji in Fig. ^ The sum over graphs and regions can be reformulated as independent 
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sums over the hard part and the jet subgraphs, and this gives the desired factorization 
theorem: 

N 

a[W] = KY,Y.^n X Y[Jj W{f) + non-leading power. (3.5) 

/ N j=l 

Here, the hard part is denoted by Hn, with a hat over the H. The hat indicates 
that the contributions to the hard part are not obtained simply as the contributions 
of appropriate Feynman graphs in a certain region of momentum space. Instead, as 
we will see, they are obtained from the relevant Feynman graphs with the aid of a 
subtraction procedure that prevents double counting. The factor A is a Jacobian, 
whose calculation we will see later; it relates the integration over the massless final- 
state momenta of Hn to the integration over the correct and off-shell parton momenta. 

5. It is convenient to adjust the normalizations to give a formula that has a convenient 
interpretation as a hard-scattering factor times jet factors that are normalized to be 
probabilities — see Eq. ( |6.45| ) below. 

In a Monte-Carlo event generator, all integrals over final states are done by a Monte-Carlo 
technique. After generating the distribution of partons associated with the hard scattering, 
the distribution over invariant mass for each jet-initiating parton Ij is computed. Because 
of the collinear safety of total cross sections, this is a perturbative calculation when the 
virtuality of Ij is large. Then a further application of the factorization theorem is made 
to each jet, whenever the invariant mass of the initiating parton of the jet is large enough 
for the use of perturbative methods. This procedure is applied recursively until all the 
generated partons have low invariant mass. After that a hadronization model is applied, so 
that a hadronic final state is generated. Finally, the weight function W{f) may be applied, 
to define a particular cross section. Since the weight function is only applied at the last 
stage, all the previous steps can be implemented independently of the cross section being 
computed. Hence the events can in fact be generated without regard to the weight function; 
the weight function simply provides a convenient tool to unify all cross section calculations 
into a single formula. 

D. Comments 

The proof is given only in perturbation theory, but with a treatment of all orders at the 
leading power. Nevertheless the structure that is derived is evidently more general. Thus 
one can conjecture that a better and less perturbative treatment is possible. In particular, 
it is very natural that the structure of Fig. ^ should hold beyond perturbation theory. 

Factorization is needed so that the structure of the algorithm for generating events is 
simple. One important feature is that the hard scattering contribution from each region is 
computed in the approximation that its external lines have zero virtuality. Thus the compu- 
tation of the hard-scattering can be done independently of the subsequent computation of 
the properties of the jets. This approximation is good up to the claimed power-law correc- 
tion when the virtualities of the jet-initiating partons are low. When the virtualities of these 
partons get large, the approximation worsens. But then, as we will see, the higher-order 
corrections to the hard scattering correctly handle this situation, and compensate the errors. 
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For the decomposition of the graphs into a sum over regions, phase-space shcing is the 
most obvious method. However it does not produce the desired result, since the errors are 
fairly hard to control, and the errors tend to be quite non-optimal'^. This will lead us to use 
a more formal subtraction method. 

Essential requirements on its implementation appear to be: 

• Each particular subgraph for H and Jj gives the same result in all contexts, indepen- 
dently of the graph F from which the subgraph arises. This allows the manipulations 
between Eqs. ( p.4|) and ( |3.5|) to work. 

• The space of final states factorizes into parts associated with the partonic final states 
of the hard scattering and parts associated with the final states of each jet. Then 
factorization for the cross section follows from factorization for the amplitudes in each 
region. Again this is needed so that the manipulations between Eqs. ( p.4| ) and ( p.5| ) 
work. 

• The quantity Cr{T) that defines the contribution of a graph F in a region R must be 
an ordinary function, not a singular distribution. Then the integral over final states 
can be implemented by a Monte-Carlo method. 

E. Errors 

It is a little complicated to explain what are the errors on the approximations we make. 
This is an important issue to get correct, because if an event generator is used to make 
predictions that are compared with experiment, we need to understand the significance of 
deviations between the data and the predictions. 

First observe that the amplitudes are treated with the correct final states but with an 
approximation on the values of the amplitudes. The power counting theorems of Libby and 
Sterman show that the total cross-section is computed up to an error that is suppressed 
by a power of Q. Moreover this cross section is correctly decomposed into its exclusive 
components. So the obvious expectation is that the differential Monte-Carlo cross section 
differs from the exact cross section by a power suppressed error: 

c^cxactfW^] - crcaicuiated[W^] = o-Qxact [W^] X 0{m/QY (iucorrcct estimate). (3.6) 

Here m is a typical hadronic mass scale, and p is some positive number. For most final 
states, this estimate is indeed correct, since the approximations used are good to power-law 
accuracy in each leading region. 

However, there are other, non-leading regions, and for the theories under consideration 
this means regions that are not of the form of Fig. ^. The approximations we made are not 
valid in these regions, so a substantial deviation between theory and experiment can occur 
in a non-leading region. On the other hand, a non-leading region gives a power-suppressed 
contribution to the cross section; that is exactly what it means to say that the region is 
non-leading. So typically we will have small contributions to the cross section from such a 
region. 



^ The word "error" in this context means the difference between the exact value of a graph and the 
approximation used in the sum over regions. 
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Therefore we need to modify the error estimate. The following is one possibility: 

'^exact 

[W] 

^calculated 

\W] = (Ttotai X 0{m/QY (weak estimate). (3.7) 

This says that the error is power suppressed with respect to the total cross section. Unlike 
the previous estimate, it is excessively weak, since the non-leading regions will populate only 
certain regions of the space of final states. The simplest correct estimate is 

o"exact[W^] - c^caicuiatcdlW^] = cTcaicuiatcd [W"] X 0{m/Qf + terms for nou-leadiug regions. (3.8) 

This says that the errors come from two sources: power-suppressed errors in the approxima- 
tions, and contributions from non-leading regions. It leaves open the issue of exactly which 
hadronic final states are populated by the non-leading partonic regions. 

An example of the issues can be seen with the aid of the graph of Fig. |^. The only leading 
region for this graph is the wide angle region, where the partonic final state makes 3 jets. 
It also has non-leading regions. Among these are 2-jet regions, where 2 partons are close in 
angle. Only one propagator has a small denominator, so the graph's contribution is much 
smaller than the contribution of graphs like those of Fig. ^ where two denominators are 
small. Since there are other contributions to the cross section that are bigger, we see that 
the graph makes a power-suppressed contribution to the 2-jet region. In addition, there is 
a region where parton p2 is soft, i.e., has all its momentum components much less than Q. 
This region would be leading in QCD, if the soft parton were a gluon, but the region is 
non-leading in a non-gauge theory. However, the kinematic region it gives in the final state 
is not necessarily populated by the leading regions, so this case gives a non-trivial example 
of the last term on the right-hand side of Eq. (|3.8|) . 



IV. KINEMATICS OF HARD SCATTERING 

The core of the proof of factorization is a treatment of the sum over leading regions for 
an individual graph in Eq. (|3.3|) . Each term Cji{r) is defined by an appropriate kinematic 
approximation and a subtraction procedure. Given this definition, one must prove the error 
estimate in Eq. ( |3.3| ). Various properties of the kinematic approximations and the subtrac- 
tions are needed to obtain the Monte-Carlo algorithm and the methods for the calculation 
of non-leading corrections. These properties ensure that the combinatorial part of the proof, 
between Eqs. ( p.4|) and ( |3.5| ), actually works. 
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In this section we define the approximation associated with a particular region, and derive 
its important properties. 



A. Approximation 

Consider a generic leading-power contribution from a particular graph for the cross sec- 
tion, as symbolized in Fig. |^. From it we will now construct an approximation that is suitable 
for obtaining a factorization theorem and a Monte-Carlo implementation. The main idea of 
the approximation is to replace the external momenta of the hard subgraph H by on-shell 
massless momenta, and also to set the masses of the internal propagators of H to zero. Since 
the approximation will be integrated to a region outside of its domain of validity, it will be 
necessary to give precise definitions of the transformation from the exact momenta to the 
massless approximated momenta and to compute the Jacobian of the transformation. 

Let V\yV] denote the particular graph that is being computed, and let it be decomposed 
into subgraphs corresponding to a particular region R of the form of Fig. |^, with being 
the hard subgraph and with Jri . . . J^^ being the jet subgraphs:^ 

T[W] = I dL{p;q,m) Hnil)l[Jnjipj)Wif). (4.1) 

j 

Here the momenta are defined as follows: 

• The final-state particles of jet subgraph J^j are pj i, . . . , Pj^n^y where Uj is the number 
of final state lines of J^j. 

• Then pj, with one subscript, denotes the collection of these rij momenta, i.e., Pj = 
(pj^i, ■ ■ ■ }Pj,nj^- The sum of these momenta is Ij = J2i=iPj,i, which is the momentum 
of the line joining Jjij to the hard subgraph if^. 

• The symbol p, with no subscripts denotes the collection of all momenta of the final 
state particles. 

• The symbol /, with no subscript, denotes the collection of the l/s, i.e., = 

. . . , l'^^ . The sum of these momenta is the photon momentum: Pj,i = J2j Ij = 

The symbol dL{p\ q, m) denotes the measure for integration over the Lorentz-invariant phase 
space: 

dLip; g, m) = n ./.f^'l i^^f ^^E P.. - <i)^ (4-2) 
with the argument m of dL denoting the mass of each of the final-state particles. 



^ Note that the symbols Hn and Jpi,j are not to be identified with the corresponding symbols in Eq. ( |3.5[ ), 
although they have related meanings. In that equation, Hj^ and Jj mean the fully subtracted contributions 
to the the hard part and the jet factors, with sums over all contributing graphs. In contrast, Hr and Jaj 
in the current section mean the values of subgraphs of a particular graph F, with the subgraphs being the 
hard and jet subgraphs associated with a particular region R. 
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In Eq. ( |4.1| ), Hr{1) denotes the hard factor, i.e., the expression for the product of the two 
subgraphs labeled H in Fig. ^ on each side of the final-state cut. The hard subgraph is to 
be amputated, i.e., it is without any factors for its external lines, and it is to be integrated 
over all its internal loop momenta. 

Similarly each JRj{pj) denotes the corresponding jet factor in Fig. ^. All its external line 
factors are to be included. It is also to be integrated over all its internal loop momenta. 

It is important to the derivation of a factorization theorem that the integration over 
final-state phase space in Eq. ( |4.1| ) can itself be factorized: 




(27r)52y'/| + M| 
dL{l-q,M) {Y{dL{p,-l,,m)'j . (4.3) 

In the first and second lines the phase space is decomposed into separate phase-space inte- 
grals dL{pj] Ij) for the final-state momenta of each jet. Then in the third and fourth lines, 
the integration over the /j's is decomposed according to the invariant mass of these mo- 
menta, so that the integration is presented as an integral over the momentum of a set of 
particles with masses Mi, . . . , M^r^. Notice that in the last line it was necessary to make 
explicit the mass arguments for the phase space of the jet momenta: dL{l] q, M). 

We define the approximation appropriate to the region i? as a replacement symbolized 
by an operation Tr: 



T[W] ^ TnT[W] = J dL{i;q,0) HR{i,m-^0) 



X 



r dM" r 

n [y dL{pf,l„m)Q{Mf/f,l)Jn,{p, 



Wif). (4.4) 



This approximation is to be thought of as a kind of Taylor expansion of the hard part 
Hr. The hard subgraph and the associated phase-space integrations have been replaced 
by massless approximations, and / is used to denote the massless approximated momenta 
that correspond to the incoming momenta of the jets. A precise definition of the relation 
between the approximated and unapproximated momenta will be given below. Note that the 
unapproximated jet momenta still have their physical non-zero invariant masses: /| = Mj. 

In a region where the final-state lines of the subgraphs Jrj do in fact form jets, TrF pro- 
vides a good approximation to the original graph F, up to power-law corrections. However, 
we will need to use TrF outside the domain of validity of the approximation, and then the 
integrals over the jet masses Mj are no longer restricted by momentum conservation. It is 
therefore necessary to introduce a cut-off function 6(iW|//i|,) for large jet masses. Here fip 
is the so-called "factorization scale", which in principle is arbitrary. The function Q{x) must 
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equal one for small values of x and zero for large values of x, and for this an ordinary 6 func- 
tion is suitable^: 0(x) = ^(1 —x). However a smoother function would also be appropriate, 
and is likely to be better for numerical work 0. 

The factorization scale determines the apportioning of the cross section between LO 
and NLO (and higher) corrections. As usual there is a renormalization-group invariance 
of the complete cross section under change of fip'- When all orders of perturbation theory 
are included, the factorized cross section is accurate up to power-law corrections. However 
truncation of a perturbation expansion produces errors of the order of the next higher power 
of Og, and an inappropriate choice of fip will produce logarithmically large perturbative 
corrections. For the hard scattering, fip should therefore be around Q, or a bit smaller, if 
the choice Q{x) = 6{1 — x) is made. This is because Q sets the typical scale for virtualities 
in a hard scattering. A change in the functional form of B simply corresponds to a more 
general renormalization-group transformation. 

It may be useful to apply a constraint on fip so that the momenta automatically avoid 
violating the constraints given by momentum conservation. For this we choose hf so that 
6(M^//i|.) is zero whenever M > Q/Nji, where Nji is the number of jet subgraphs for the 
region R. 



B. Definition of projection onto massless momenta 



The main element of the approximation Tijr[Ty] in Eq. ( [4.4|) is the replacement of the 



external momenta of Hr by a suitable massless and on-shell approximation; let us notate 
this by 

i=PN{l). (4.5) 

The operator P/v is a kind of (non-linear) projection, and the definition^ that I choose is 
that in the overall center-of-mass frame 



ij , (4.6) 



with 

A = ^|Z,|/Q. (4.7) 

j 

That is, the spatial momenta are simply scaled by a common factor A so that the total 
spatial momentum remains zero. The scaling factor A is chosen to give the correct total 
energy, so that the total energy-momentum is preserved: 

E^^i =E^i = W'O)- (4-8) 

j j 

The physical requirement of conservation of energy and momentum has the mathematical 
consequence that the delta function 5^^\Y,j ~ q) in the phase-space integral remains un- 
changed, apart from a Jacobian factor, after the jet momenta Ij are replaced by their massless 



^ Hence the notation! 

^ It is likely that other definitions are possible and even useful. However, that issue will not be explored 
here. 
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approximations Ij. Note also that the operator P/v satisfies the relation P/v(P/v(/)) = P/v(/) 
characteristic of a projection. 

The derivation of the factorization theorem will rely on the fact that the transformation 
/ I— i> / has a smooth limit when some or any of the masses of the momenta go to zero. This 
enables us to use an expansion in powers of masses relative to Q at appropriate places. 

The inverse transformation from I to I will be used in the Monte-Carlo algorithm. It is 
given by 

= ^Jl] + Ml (4.9) 

with A now being the solution of 

5:yA%' + M| = Q. (4.10) 

j 

Notice that this last equation has a solution if and only if < Q-, and that then the 

solution is unique, with < A < 1. 



C. Jacobian 

As we will see in later sections, a contribution to a hard scattering coefficient contains the 
value of an actual Feynman graph together with subtractions that prevent double counting 
of lower-order contributions. Internal lines in the original graph will have their exact energies 
and momenta, while the subtraction terms will have approximated energies and momenta. 
However, in a calculation of the hard scattering coefficient we must use the same phase- 
space measure in both the original graph and the subtractions. Therefore we will need the 
Jacobian of the transformation between the exact and the approximated parton momenta: 

dL{i;q,0) = dL{l;q,M)A{l), (4.11) 

so that, for example, Eq. ([4. 4]) can be written as 



r[W] ^ TnT[W]^Y.^i^) HR{i,m = 0)l[[e{P^/f,l)Jn,{p,)]w{f), (4.12) 

/ J 

where the sum over / means an integral over all final-states with the correct phase-space 
measure dL{l; q, M), just as in Eq. (|3.2| ). 

Some reasonably straightforward manipulations show that the Jacobian is 

Here again Nji is the number of jet subgraphs for the region in question, and A is defined by 
Eq. ([4.7|). In a general number of space-time dimensions D, the formula remains the same. 



except that the factor of A^ is replaced by 
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In the massless limit, the Jacobian approaches unity: 

A(/) = 1 + 0(MVQ') 



(4.15) 



where the correction term contains a term for each of the masses. 

In the case of a region with two jet subgraphs, Nr = 2, these formulae simplify, and we 

have 

2|/i| 



A(2 jets) 



Q \ 



(Ml + M2) 



(Ml - M2)2 



and 



A(2 jets) 



2|/i| 



(Ml + M2 



N21-3/2 r 



1 - 



(Ml - M2) 



-3/2 



(4.16) 



(4.17) 



D. Jet factors and factorization 

With the above definitions, we can rewrite the factorization formula Eq. (|3.5| ) in terms 
of phase-space integrals for massless partons and of factorized integrals for each jet. 

a[W]=Kj2 dL{i-q,0) H^ll 

N=2 j 

The use of the approximated phase-space measure, dL{l;q,0), for the external lines of the 
hard scattering is important, since it keeps the algorithm in the event generator simple: the 
generation of the hard scattering part of the event can be performed without knowledge of 
the subsequent showering and hadronization of the external partons. 



rfM/ 
2n 



dL{p,-i,)e{i]/ij^l)j,{p,) 



W{f). (4.18) 



V. EXAMPLES 

In this section, I will show how the methods used in the proof of the factorization theorem 
are applied to particular examples of Feynman graphs. This will provide detailed motivation 
for an appropriate construction of the contribution for each particular region of momentum 
space, and in particular for the form of the subtraction terms. 



A. Need for subtractions 



A initial attempt to obtain a factorization theorem goes as follows: First, sum Eq. ( |4.4| ) 
over all possible jet configurations. Then, for each configuration, sum over all possible graphs 
compatible with the configuration. As follows from the argument in Sec. [Ill C| , the result 
would appear to have the factorized form given in Eq. ( |375| ) or ( [4.18[ ), provided that we 
define the contribution associated with region R to be 



CijF (region method) = TrF 



(5.1) 

restricted to region R 



We would then define the hard scattering coefficient if a? to be simply the sum over all graphs 
that make N massless outgoing partons, but with a restriction on the momenta to be in the 
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appropriate hard region. There is a corresponding restriction to coUinear configurations in 
the jet factor. 

Although this simple phase-space-slicing argument contains a core of truth, it fails to be 
exactly correct, for several interrelated reasons: 

• The boundaries of the various regions are hard to define precisely for the general case, 
and most importantly the regions are liable not to factorize into hard and coUinear 
parts. 

• If one does require a factorized form for the shapes of the regions in momentum space, 
it is possible that there can be double counting of some parts of momentum space, 
and omissions of other parts. It is quite unobvious how to automatically avoid such 
double- and/or non-counting in general, with a phase-space-slicing method. 

• A particular jet subgraph may correctly lie just below a threshold defining a jet, but 
some lines in the hard part may have invariant masses only just above the threshold. 
Then it is far from clear that the massless approximation for the external lines of 
the hard part results in a power-suppressed error. So there is a danger that the 
factorization theorem proved with the phase-space-slicing method fails to accomplish 
its purpose of providing an approximation to the cross section that is valid to the 
whole leading power. 

Instead, we will use a subtractive method. We will still construct the asymptotic form 
of the cross section as a sum over terms, one for each region of each graph, as in Eqs. ( p.3| ) 
and ( p.4| ). Each term Cr^T) will be exactly a contribution to the hard scattering times 
a contribution to the jet factors, and it will also equal the basic approximation T^F plus 
subtraction terms that cancel the effects of double counting between regions. However, 
the use of a subtraction method allows the integrals over momentum to be unrestricted. 
The sum over all graphs and regions then gives the factorization formula, Eq. ( [3.51 ). The 
combinatoric argument is the same as in the naive phase-space slicing method, but now 
there is no restriction on the integrations, other than from the cut-off function in the 
definition of the jet factor — see Eq. (|4.18 ) and Eq. ( |6.42 ) below. 



B. Simple example 

In this section, I will implement the subtraction method for the simplest example, the 
cut graph. A, of Fig. |^, whose final state has three particles, of momenta pi, p2 and p^. 1 
will give a lot of explicit detail for what is actually a fairly simple problem. The reason is 
that the techniques will generalize to much more complicated situations, but only a precise 
treatment will make this generalization clear. 

The graph has the following regions for its leading twist contributions: 

Ri. pi and p2 are almost parallel. 

R2: All the final-state momenta are at wide angles with respect to each other. This is a 
purely ultra-violet region where the graph can be correctly approximated by setting 
all masses to zero. 
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FIG. 6: First leading region of Fig. and its standard approximation. The vertical bars with the 
squared ends indicate where the collinear approximation of Eq. ([4.4[) is made. 



We will see how, in the leading power of Q, T can be written as a sum of a term for each 
of the two regions, plus a suppressed remainder term: 



2 ^ 

m 



A = A, + A2 + 0l\A\ — \. (5.2) 



Here, each of Ai and A2 is a particular term on the right-hand side of Eq. (|3.3| ), with Aj 
meaning Cr. (A). 

Region Ri In this region, a good approximation, symbolized in Fig. ^, is 

Ai =Tr^T = dL{ka,kb;q,0) HAi{ka,h) dDi2{pi,P2), (5.3) 



where I have used the definition, Eq. ( [4.4|) , for the Taylor expansion operator, Tr^. 

In this approximation the two external momenta of the hard subgraph are replaced by 
on-shell massless momenta: 

K = T = 5(1, -Ps/IpsI), (5.4) 

K = f (1, Ps/IpsI). (5.5) 

The scaling parameter A^i that appears in the transformation between the exact jet momenta 
and the massless approximation has the value 

A.. ^ (5.6) 

In the projection notation, {ka, k^) = P2{pi + P2, Ps)- 

The hard scattering factor in Eq. ( |5.3|) is simply unity, which is the value of the lowest 
order graph. The jet factor is defined by a suitable specialization of the factor in square 
brackets in Eq. (|4.18| ): 
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Here, I12 = Pi + P2 is the exact momentum of the intermediate parton. The errors in the 
approximation are solely in the massless approximation to the two-particle phase space, so 
that we can write 

A = Ai\l + 0(MiVg2)] . (5.8) 



Here M12 is the invariant mass of the final state in the jet subgraph, i.e., yipi +^2)^- 

When we construct the subtractions for the second region, we will need the formula for 
the approximation in terms of the original phase-space integration: 



Ai = dL{p; g, m) Aai Q{{pi + P2) V/^l) 77 — ^ . (5-9) 



[{pi + P2y - 

where the Jacobian is 



from Eq. ( p^ ). 

Region R2 In this region, all the external particles are at wide angle, so that the appro- 
priate approximation to the whole graph is 

Tr^A = dL{l2,a, h,b, k,c; q, 0) HA2{l2,a, h,b, h,c)- (5.11) 

The hard subgraph Ha2 is the whole graph, taken in the massless approximation: 

Ha2 = .,.2 , (5-12) 



and its external momenta are: 



'i. - . (5.13) 

1 5 

I 5 

Aa2 



IL - . (5.14) 



IL - . (5.15) 



where 



\a2 = 7; = l + 0[m^/Q''). (5.16) 



Q 

Now we note two related facts 

• Ha2 has a collinear singularity when ^2,0 and /2,fe become parallel. 

• Term Ai, which I have constructed to be a good approximation in region also 
contributes in the larger region R2. 
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So I define A2 by approximating the original graph minus the approximation 1 have 
already constructed for the smaller region Ri. The simplest definition would be A2 = 
Tff^{A — TrjA). However a small modification turns out to be convenient, and 1 define 



A2 = V{M^2/m)TnM-TR,A) 



= dL{p-q) V{M,2/m) Aa2 ^] ,,.2 - mla + kb)V f^l) Aai'J • (5.17) 

[{l2,a + kfirl ^ ^ 

Here Aa2 is the Jacobian for the transformation (^1,^2,^3) ^ il2,a,h,b,h,c), which is equal 
to unity plus corrections of order m?/Q'^. The Jacobian A^i' in the subtraction term is 
the approximation to A^i in which the momenta pi, etc., are replaced by their massless 
approximations /2,a, etc., i.e., 

Aai' = = ^Ai X [1 + O(mVg')] , (5.18) 

The remaining factor V"(Mi2,m) is what 1 will call a "collinear veto factor". Its purpose is 
to remove a spurious collinear divergence. After explaining the need for this factor, 1 will 
show that the sum of Ai and A2 provides a good approximation everywhere, Eq. ( p. 21 ). 

First, recall that Ai is obtained from A by the approximation of neglecting both the jet 
mass M12 and the particle masses with respect to Q. So the sources of error give terms of 
order M^^IQ"^ or rn? /Q"^. Since M12 is always larger than 2m, we can write 

A-A, = o{\A\^Y (5.19) 

We need to assume here that the factorization scale is of order Q; otherwise we would 
have a large logarithmic contribution to the total cross section. 

Next, we observe that A2 is obtained from A — Aihy applying a massless approximation 
for the external particles, as is appropriate when all^e particles are at wide angles. This 
approximation results in an approximated jet mass M12 that is different from M12. When 
the final-state particles are at wide angles, the exact and approximated masses are equal 
up to corrections of relative size w? jQ'^. But as the two particles become exactly paral- 
lel, the approximated jet mass goes to zero and produces a singularity in the propagator 
denominator, whereas the exact jet mass never goes below 2m. Thus, without the veto 
factor, A2 has a singularity in the collinear region. Indeed, when we replace the correct 
propagators 1/ [(pi +^2)^ — '^^j in the original graph A by their massless approximation 
1/ [(^2,a + ^2,6)^]^! we get a well-known logarithmic collinear divergence in the integral over 
all jet masses. 

However, the estimate Eq. ( |5.19| ) ensures that there is a power-law suppression: 

A,, = 




(5.20) 
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The estimate on the first hne results from the replacement of the exact propagator by the 
massless approximation, and is valid when M12 ^ m. The second line results from applying 
Eq. ( p.l9| ) in the massless approximation. 



Although we now see that the logarithmic divergence in the integrated cross section has 
been removed, there remains a divergence in the differential cross section. This can be quite 
annoying in practice, and it also complicates the proofs and erroi^ estimates. So we simply 
remove the divergence by defining A2 to include a veto factor V{Mi2/'m). One suitable veto 
factor is simply a theta function 

V{Mi2/m) = 9{Mi2 - 2m). (5.21) 

Any other function satisfying the following properties would be equally good: 

• V is zero when M12 is significantly less than m. 

• is unity when M12 is significantly greater than m. 

The first requirement ensures that the divergences are removed. Indeed the whole region 
when A2 is much larger than A — Ax is also removed, i.e., the region where a massless 
approximation to A — is a disaster. Since the purpose of A2 is to handle momentum 
configurations far from the collinear region, the precise functional form of V in the collinear 
region is irrelevant to physics. The second requirement, that V be unity away from the 
collinear region, ensures that A2 accomplishes its purpose of participating in a better ap- 
proximation to A in the non-collinear region, R2. _ 

The veto factor ensures that the approximated jet mass M12 is never much smaller than 
m, i.e., M12 ^ m. In this case, the true and approximated jet masses are comparable. 
Thus, when we set m to zero in calculating A2, we are neglecting m in comparison with the 
smallest virtuality, which is always of order Myi- Hence 



where we have used Eq. ( p. 19 ). This is exactly the desired result, Eq. 



Notice how the exact form of the veto factor does not affect this derivation. A change 
in the veto factor only affects the collinear region, M12 ~ m. Although the change might 
result in a 100% change in ^2, this does not affect the error estimate, since both A — A\ 
and A2 are inevitably of order m? /Q"^ (relative to A) in this region. 



C. Harder example 



In this section, we treat a more complicated example, the cut graph, F, of Fig. ^, which 
has four particles in the final state, of momenta pi, p2-, Ps and p^. As was observed earlier, 
the graph has the following regions for leading twist contributions: 

Ri: pi and p2 are almost parallel, and p^ and p^ are also almost parallel. 

R2: ps and p4 are almost parallel, but pi and p2 are at wide angle. 



Ry. pi and p2 are almost parallel, but ps and p^ are at wide angle. 
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FIG. 7: Relation between leading regions of Fig. ^. 




FIG. 8: Graph in cp'^ theory with overlapping ultra-violet divergences. 
R4: All the final-state momenta are at wide angles with respect to each other. 



Our treatment will start off being almost identical to that for Fig. 0, in Sec. |VB| . But 
complications will arise later because there is the equivalent of an overlapping divergence. 

As before, I will show how, in the leading power of Q, F can be written as the sum of a 
term for each of the four regions, plus a suppressed remainder term: 



F1 + F2 + F3 + F4 + O |F|— . (5.23) 




In the more general notation of Eq. (|3.3| ), we would have Cr-^T instead of Fi, etc. To organize 
the subtractions in these terms, we need an ordering relation between the regions, which is 
essentially set theoretic inclusion. For the graph under discussion, this relation is symbolized 
in Fig. 0, and I notate the corresponding orderings as Ri < R2, Ri < R3, R2 < R4 and 
R^ < R^i. This means that Ri is contained in both R2 and i?3, and that both R2 and R^ are 
contained in i?4. This same structure appears in simple examples of overlapping ultra-violet 
divergences. (For example it gives the relationships between the regions of UV divergence 
for the graph of Fig. |^.) Thus the example of Fig. ^ provides an interesting and non-trivial 
test of our methods. 

A general and precise definition of the ordering is made by associating each region with 
a particular set of momentum configurations of massless momenta, and then defining the 
ordering as simple set theoretic inclusion of these massless configurations. The configurations 
associated with a region R are those where the momenta within each jet are made massless 
and exactly parallel. These are exactly the configurations that are used to classify the 
pinch singularities of massless graphs in the Libby-Sterman analysis |TD[. For example, the 



configurations for Ri are all those for which pi and p2 are exactly parallel and massless and 
Ps and p4 are exactly parallel and massless. Similarly, the configurations for R2 are those 
for which ps and p4 are exactly parallel. Thus one set of momenta is contained in the other: 
Ri C R2, and I use this as the definition® of i?i < i?2- 



Note carefully the distinction between the massless coUinear configurations that are used to define relations 
like Ri < i?2 and the corresponding sets of momenta that give the important regions contributing to the 
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M12 



FIG. 9: Geometry of the leading regions for Fig. 





FIG. 10: First leading region of Fig. ^, and its standard approximation. 

In the case of the regions for our graph, a convenient visualization of the geometry is 
given in Fig. ^. This shows the plane of the two relevant variables, the mass M12 of the 
(12) pair and the mass M34 of the (34) pair. The singular surface associated with i?i is 



cross section. The massless collinear configurations form skeletons, so to speak, for the regions. Defining 
the relations R\ < R2 by set-theoretic inclusion for the skeletons is exactly literally correct. But the 
same definition applied to the regions requires careful interpretation to deal with the boundaries etc. 
(The concept of a massless collinear momentum configuration is precisely defined, but the concept of an 
important momentum region is rather fuzzy and not susceptible to a precise and useful mathematical 
definition.) 

A related issue is the commonly seen misunderstanding that the factorization theorem is concerned with 
the contributions of collinear (and soft) divergences. Literally speaking, this assertion is false, as is clear 
in any field theory that has non-zero masses for all its fields: these preclude there from being actual 
collinear and soft divergences, even though the factorization theorem remains valid. It is true that the 
leading contributions are associated with divergences. But the divergences are not necessarily literally 
present in the actual theory, and a correct treatment cannot assume this. The correct statement is that 
leading regions of momentum space can be labeled by the skeleton configurations that are surfaces of pinch 
singularities in massless Feynman graphs, and that the leading regions are contained in neighborhoods of 
the skeleton configurations. 
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the point M12 = M34 = 0; the surface associated with R2 is the hne M34 = 0, restricted to 
the physically accessible region; the surface associated with is the line M12 = 0, again 
restricted to the physically accessible region; and the ultra-violet region i?4 is the whole 
plane. Set theoretic relations between these sets of points evidently give Ri < R2, Ri < R3, 
R2 < Ri and i?3 < i?4. 

The construction of Eq. ( [5 .231 ) will use this ordering. I will start with the smallest region, 
and construct its term Fi to be a good approximation to F in this region. Then I will 
successively construct the terms for larger regions. Each term will be arranged so that, 
when added to the previously constructed terms, it does not violate the validity of the 
approximations provided by these terms. The first part of the treatment is therefore very 
close to that of the lower-order graph Fig. |^. 

From the Feynman rules of the theory, we see that the contribution of the unapproximated 
graph to the cross section is (up to overall kinematic factors): 

F = dL[p] q, m) ^ 2 . (5.24) 

Region R\ In this region, a good approximation, symbolized in Fig. |10|, is 

Fi = Tr.F = dL{ka,kb;q,0) Hi{ka,h) dDi2{pi,p2) dD34{p3,pi). (5.25) 

It is the obvious generalization of Ai, in Eq. (|5.3|) , and it is again constructed by replacing 
the external momenta of the hard subgraph by on-shell massless momenta, which are now 



K = Ai^(|pi+P2|, P1+P2) 

Pl±P2_\ 

K = kH\p3 + P4\, p-i+Pi 




(5.26) 



^ 1, -f^V (5.27) 

2 V \P2+P2\ ^ ' 



Here 



A, = i^±i^ . (5.28) 

In the projection notation, (/c^, kb) = ^2(^1 + P2, Ps + Pi)- 

The hard scattering factor Hi in Eq. ( |5.25| ) is simply unity, which is the value of the 
lowest order graph. The jet factors are defined by Eq. ( [5.7] ) for dDi2, and a corresponding 
formula for dD^^. The errors in the approximation are solely in the massless approximation 
to the phase space of the hard scattering. Therefore we have 

F = Fi [1 + 0{M^/Q^)] , (5.29) 

where Mi is the larger of the masses of the two pairs of particles: Mf = max(M^2) ^34)- 

In constructing subtractions for other regions, we will need the formula for the approxi- 
mation in terms of the original phase-space integration: 

. A, «;«<^'7-'V^) . (,30) 

[(pi + P2)^ - m^r [{P3 + Pi) - m \ 
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where the Jacobian is 

2|pi +P2 



= ( ^ ] , (5.31) 



from Eq. { ^ATj ). 

Observe that Fi differs from the original graph only by the Jacobian of the phase space 
and by the cut-offs for the jet factors. So in terms of methods for calculating individual 
low-order Feynman graphs, we have gained nothing; indeed we have lost something. What 
we have gained is a form that can be generalized to higher-order graphs and converted to a 
systematic algorithm for obtaining useful approximations to large numbers of graphs with 
a computational effort merely proportional to the number of final-state particles. 

Region R2 In this region, where only particles 3 and 4 are coUinear, the appropriate 



approximation to the whole graph is symbolized in Fig. 11 



Tr^T = dL{k2,a, h,b, h,c; q, 0) H2{k2,a, h,b, ^2,c) dD34{p3,p4). (5.32) 



The hard part H2 is (the square of) the left-hand subgraph of Fig. [11 

-.2 



whose external momenta are on-shell and massless: 



where 



Ka = A2'(|pi|, Pi), (5.34) 
^2,6 = ^2^(1^2!, P2), (5.35) 

^2,c = ^2^{\P3+P4\, P3+P4), (5.36) 



A2 = . (5.37) 
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Just as before, I define the contribution associated with region R2 by subtracting an 
allowance for the smaller region Ri: 

^2 = V2Tn,{T-r,) 
= V2TR,iT-TRj) 



dL{p;q) A2V2 



1 _ e((4, + k2,,f/fil) Ai 



Here 



The Jacobian for the transformation to A;2,a, ^2,6, and A;2,c is 

E, E2 E, + E, fpl ^ pI ^ {ps + p,f\ 1 



(5.38) 



(5.39) 



IPiI IP2I |P3 + P4I V^i ' E2 ' E3 + E4 J XlQ ' 
from Eq. (^J^). 

Ai' is Ai with the momenta pi + p2 and ps + p^ replaced by ^2,0 + ^2,6 and k2^c, i-e., 

3 



Ar 



Q 



2\h 



(5.40) 



■2,c 



The veto factor Vo is defined to be 



V2 = V{M^2/m) 



(5.41) 



where V is the same veto function as in Eq. ( ^.21|) . The reason for requiring a veto 
factor is the same as with Eq. ( ^.17| ). 

The formula for r2 is symbolized in Fig. It is power suppressed in the smaller region 
whereas the massless approximations to the individual terms F and Fi both have coUinear 
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singularities on the surface -Ri. The sum of the two terms I have generated so far gives a 
good approximation to the original graph in a neighborhood of R2 (and hence of the smaller 
surface whose defining surface is a subsurface of -R2): 

r = (Fi + 12) [1 + 0{MlJQ^)\ , (5.42) 

where M34 is the mass of the pair of particles 3 and 4. The proof of this equation is the 
same as for the corresponding equation Eq. ( p.2| ) for the simpler graph. 

A quick examination of Fig. ^ might suggest that the there is some kind of singularity on 
the line i?3, and that this would cause a problem in applying the above argument around the 
point Ri- This is not so. The treatment of the double-collinear region associated with Ri 
applies uniformly in a whole neighborhood of the point -Ri. The line R^ gains its significance 
only when one moves substantially away from Ri and finds a region where only pi and p2 
are collinear while the other momenta are at wide angles. 

There is an important transitivity property of successive projections of momenta. We 
obtained ka and k^, by a projection of the four momenta pi, ^2, and P4. Instead we could 
project the three momenta A;2,a; ^2,6, and A;2,c onto two momenta by 

K' = ^21 {\k2a + k2b\ , k2a + ^sb) , (5.43) 

K = A2/(|fc2,c|, fc2,e), (5.44) 

with 

\k2a + k2b\ + |fc2,c| 

A21 = g • (5.45) 

It is readily verified that k'^, = k^ and k^, = /c^. This is the transitivity property: one gets 
the same momenta by any appropriate sequence of projections. The proof in general is made 
by observing that each projection amounts to scaling all the relevant spatial momenta by a 
factor. The energies are defined by requiring the particles to be massless, and the scaling 
factor is defined by requiring the total energy always to be Q. 
Region R3 Just as with region R2, I define 

rs = V3Tn,{T-r,), (5.46) 

and I add it to the list of previously generated terms to obtain Fi + r2 + Fa. 

But now a new problem appears when we try to show that Fi + F2 + F3 is equal to F up 
to a non- leading term near both the singular surfaces R2 and R3. We have already seen that 
the sum of two of the terms, Fi + F2, is close to F near R2 — see Eq. (|5.42|) . Of course, it is 
also true that the sum of the other pair of terms, Fi + F3, is close to F near R^. But what 
we need is a result for the sum of all three terms. This extra complication is associated with 
the "overlapping divergence" structure in the relations between the regions, as symbolized 
in Figs. 1^ and ^ 

What we need to show is that the sum of the three terms is a good approximation in a 
neighborhood of both R2 and R3 (and of -Ri), i.e., 

F = (Fi + F2 + F3) [1 + 0(MVq2)] ^ (5_47) 

where now M is the minimum of: (a) the mass of the pair of particles 1 and 2, and (b) the 
mass of the pair of particles 3 and 4. 

The arguments I have used are sufficient provided that I prove the additional results 
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• r2 is power suppressed in region R^: it is ©(rMfj/Q^)- 

• Fs is power suppressed in region R2: it is OiTM^^/Q"^). 

These can be summarized by saying that each term is power-suppressed when we approach 
a region that overlaps the design region of the given term. 

Consider first whose definition is given in Fig. |T2|, or equivalently Eq. ( p. 3^ ). It is 



designed with region R2 in mind, which is the region where the lower two external lines (pa 
and Pi) are collinear. That is, it is concerned with a neighborhood of the line labeled R2 
in Fig. ^ Now F2 contains a subtraction which cancels the leading contribution from the 
smaller region Ri. That is, there is a power suppression, OlMf^/Q"^) relative to the size 
of F, when the upper two external lines become collinear. This suppression appears as one 
moves along the horizontal line in Fig. ^ 

Now observe that this suppression holds over the whole region of phase space, rather than 
just only around R2, the horizontal axis in Fig. |. The reason is simply that in the definition 
Fig. |12| of F2, the dependence on M34 is completely factored out. The cancellation that 
gives the 0{M^2/Q^) estimate works independently of the value of M34, i.e., independently 
of whether the (34) pair is collinear. To put it another way, the application of the T^^ 
operation has put the k2c line in Fig. ^ exactly massless and on-shell, so it is as if the (34) 
pair were exactly collinear. 

We therefore find that 

r2/r = 0(MiVq2). (5.48) 

Observe that when we integrate r2 over all final states, the power law suppression is worsened 
by a logarithm that arises from the integral over the mass of the (3,4) pair. 

The construction of F3 is precisely such that Fi + F3 gives a good approximation to F 
when (12) are collinear, i.e., when Mu is small. Hence F = (Fi + F2 + F3)[l + O^Mf^/Q'^)]. 

Exactly similarly, F3 is proved to be suppressed in region R2, and so F = (Fi + F2 + F3)[l + 
OlM^^/Q"^)]. We have two error estimates that are both valid, so the strongest provides the 
best estimate, which is Eq. ( |5.47| ). 

These new results, as well as the suppression of F2 and F3 in region Ri, are specific 
examples of a general result, to be proved later in full generality: 

For any graph F and any of its regions R, Cr(F) is power suppressed in neigh- 
borhoods of all regions R' except those for which R' > R. 

Thus Cji(T) gives a leading contribution in R itself, as it should, and in bigger regions. Its 
very construction ensures that it is suppressed in smaller regions. We have now seen in an 
example that it is also suppressed in other regions R' that merely intersect with R (or do 
not intersect it at all). 

Region R4 The contribution associated with the purely ultra-violet region can now be 
constructed as 

F4 = V, Tr, (F - Fi - F2 - F3) . (5.49) 
The veto factor must cut out both of the extreme collinear regions, and we can define it as 

Vi = V{Mi2/m) V{Mu/m), (5.50) 
where V is the elementary veto function defined in Eq. (|5.21|) . 
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Sum With the use of all the results I have derived, it is easy to show that the sum of all 
the terms Fi + r2 + + F4 gives a good approximation to F in all the regions, so that we 
obtain Eq. ( |5.23| ). Observe that all these cancellations work point by point in the momenta 
of the final-state particles. Thus they happen independently of the weighting function W, 
and thus independently of which cross section we wish to compute. This is in contrast to 
the "analytic" methods of calculation in perturbative QCD, which treat the subtractions 
as delta functions. In the analytic methods, the power suppression of the error only occurs 
after an integral over final states with a coUinear-safe weight function. 



VI. PROOF OF FACTORIZATION 

In this section, I will give the actual proof of the factorization theorem. The theorem is of 
a form that allows the construction of a corresponding Monte-Carlo algorithm in Sec. |VI1. 



As is usual in such proofs, the details can be tedious to work through unless one has a clear 
overall view of the ideas and aims. These are all based on experience with the examples 
presented earlier. 

So first, in Sec. | V 1 A|, I will give an overall view: One starts from the characterization of 



the possible regions that are important, and then defines corresponding contributions with 
subtractions to prevent double counting. These individual contributions are of such a form 
that summing over all possibilities gives a suitable factorization formula. 

The subsequent technical work is to show mathematically that the subtraction terms do 
indeed perform the job that they are constructed to perform. This is done by constructing 
suitable estimates of graphs and subgraphs before and after subtraction. 



A. Overall view 

As we have already seen, the leading regions for graphs are characterized by a decompo- 
sition into subgraphs: a hard subgraph where all the internal momenta are highly virtual, 
and two or more jet subgraphs. So for a leading region i? of a graph F, we can symbolize 
the decomposition as 

HrX Jm-- - X Jrnr, (6.1) 

where Nji is the number of jet subgraphs for the region. Analytically, the region is a 
neighborhood of a particular pinch-singular surface (PSS), as shown by Libby and Sterman 
| |1CI| |, and the decomposition into subgraphs corresponds to a factorization of the integrand. 

For each leading region we construct an approximation to the integrand by applying a 
suitable Taylor expansion to the hard subgraph: 

TrT = Tr{Hr) X J^i • ■ • X Jrn^, (6.2) 

Just as in the examples, the Taylor expansion is used to approximate the hard subgraph 
by a massless on-shell partonic quantity and hence to correctly approximate the momentum 
integrals by a suitable factorizable form. This is what enables convenient calculations to be 
done: after the summation over graphs, the hard subgraph can be calculated independently 
of the hadronization of the produced partons. 

As we saw, we must avoid double counting, so the contribution for leading region R is 
obtained by applying the above manipulations not to the original graph but to the original 
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graph minus the previously constructed terms for smaller leading regions. In addition, we 
found it convenient to insert a veto factor that excludes small neighborhoods of the massless 
collinear configurations of the Taylor-expanded hard subgraph. So we define 

Cr{T) = VRTn{T- Cu't]. (6.3) 

\ R'<R J 

This equation gives a recursive definition of the contribution Cr^T) for region R, since it 
involves only the same quantity for smaller regions. The recursion starts with the minimal 
region(s) -Rmin, i-e., those that contain no smaller regions. For these 

Cr . T = Tr . T. (6.4) 

Hence Eq. ( |6.3|) is a valid definition of Cr. 

The veto factor can be defined as a product of an elementary veto functions: 

VR = l[V{M,/m). (6.5) 

j 

Here the product is over all combinations of external lines of the hard subgraph of R, and 
Mj is the invariant mass of these lines in the massless approximation. The elementary veto 
function V is defined in Eq. ( ^.21| ). Although there is an arbitrariness in the choice of the 



veto factors, the precise choice will not affect the physics, as we have already observed. 

Now follows a very critical point. The double-counting subtractions are applied on smaller 
leading regions than R and thus to leading regions whose hard subgraph is strictly smaller 
than Hr. Moreover the Taylor expansions are always applied to the hard subgraphs, with 
the jet factors being left completely unaltered, as is appropriate for quantities that are 
ultimately to be regarded as unknown non-perturbative quantities. Hence the subtractions 
are applied inside the hard subgraph for R, and we can write 

C«(r) = Vr TrIhr - Cr,Hr] xJr,---x Jrn^. (6.6) 
V R'<R / 



The argument now falls into the general pattern already explained in Sec. [HI C[ and we 



simply need to explain what are the jet factors Jj and the hard factor Hj<4 in Eq. ( ^.5p . 
Their definitions follow from the definition of Cr{T). The jet factors Jj are (sums over) 
unapproximated jet subgraphs, while the hard factor is obtained by summing the hard- 
scattering factor in Eq. (|6.6|) over all the relevant graphs: 

Hn= VtIh- Y Cr^hV (6.7) 

graphs y R'<oo j 

The sum is over all (cut) graphs H for the cross section which have iV outgoing parton lines. 
The unsubscripted T denotes the Taylor operator applied to the whole graph, i.e., that the 
whole graph is treated as a hard graph, and similarly for the veto factor V . The sum over 
regions R! is over all non-singular regions of H. Thus f^Ar is the properly Taylor-expanded 
and subtracted hard subgraph with external parton lines. 

So far, we have merely made definitions that generalize the structures seen in examples. 
The factorized form on the right-hand-side of Eq. ( |3.5|) is a trivial consequence of the struc- 
ture of the Taylor expansion operator that we have already defined. The non-trivial part of 
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the proof of the factorization theorem is to prove that the remainder term is indeed power 
suppressed. What we need to show is that, to leading power, each individual graph is indeed 
the sum over the defined contributions for each of its leading regions: 

r = ^ Cr(T) + non-leading power. (6.8) 

R 

That is, we must derive an appropriate estimate for the non-leading-power term. To employ 
the factorization theorem in calculations, certain relatively simple subsidiary results will also 
be needed: 

1. Infra-red and collinear safety of the subtracted hard scattering factors H. 

2. Infra-red safety of the integrated ]et factors. 

3. Renormalization-group-like equations for the scale dependence of the hard-scattering 
factors and the jets factors. 



See Thm. ^ and Sees. |VTg and [VrE| . 



The actual proofs will be relatively short. What will take the bulk of the space will be a 
series of definitions that provide a precise language for describing the various leading regions, 
their relations and the accuracy of the approximations. The main elements are: 

1. The structure of the subtractions for each region. This is an obvious generalization 
from the examples, and the structure is common to many related problems, for example 
the construction of renormalization counterterms. 

2. A precise characterization of the regions to be discussed. 

3. A quantification of the errors. This needs a method of quantifying how close a point 
is to the defining surfaces of the various regions involved, which leads us to make a 
definition of the concept of a distance to a singular surface. 

4. The actual construction of the error estimates. This involves considerations of the 
relative sizes of the virtualities of different lines, which will be characterized by the 
distances to singular surfaces. 

One tricky point is that we will want to prove a suitable error estimate for a given graph 
given that the subtractions inside hard subgraphs actually work. The subtractions work if 
corresponding error estimates are available for the hard subgraphs. The simplest and most 
obvious proofs are made with the external lines of a graph on-shell. But when these same 
graphs occur as subgraphs of bigger graphs, we will in fact need the estimates for graphs 
with off-shell external lines, and the notion of distance to a singular surface needs to be 
carefully defined to allow for this possibility. 

There will also be two views of the leading momentum regions. One will be a geometrical 
and topological view, of the kind illustrated by Fig. ^ which is symbolic of a momentum 
space of quite large dimension. In this view the key ideas are the distances to singular 
surfaces and the relations between the singular surfaces defined by set theoretic inclusion 
and intersection. The second view relates these regions to Feynman graphs, the sizes of the 
hard and jet subgraphs for different regions, as symbolized in Fig. ^ The error estimates 
are given by relating virtualities of lines in Feynman graphs to the distances to singular 
surfaces. 
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B. General formula for single graph 

The structures seen in the examples suggest the general result. To make the argument 
self-contained and precise, I will give the results and proofs as a sequence of definitions and 
theorems. The first few recapitulate results from the previous sections. 

Theorem 1 The non-ultra-violet contributions to the cross section a[W] arise from neigh- 
borhoods of the pinch singular surfaces (PSS) for the massless theory. Moreover the leading- 
power contributions in our model correspond to the regions symbolized in Fig. |^. The corre- 
sponding PSS have groups of jet lines consisting of exactly parallel massless momenta. 



Proof: This was proved by Libby and Sterman |T0| in their fundamental work 



on the analysis of the leading regions. A more recent account of the power- 
counting is given in []TT|. 



This result applies to both real and virtual corrections to the process. In the 
case of loop diagrams, it is only necessary to consider regions where the (multi- 
dimensional) contour of integration is trapped near on-shell configurations for 
the virtual lines; otherwise the contour can be deformed. This accounts for the 
requirement that the singular surfaces correspond to a pinch. The momenta for 
external on-shell lines are, of course, necessarily pinched. 



Definition 2 The regions R for a graph V are specified by the corresponding PSS, with the 
inclusion of the limit points of the surfaces. In addition I define one other region, the purely 
ultra-violet region, whose associated surface is the surface where all the external lines of the 
graph are massless. I define a singular region to be one that corresponds to a non-trivial 
PSS; the only non-singular region is the ultra-violet region. 



This definition was explained and motivated by means of an example in Sec. |V (J|. Recall 



that it was convenient to code each region by a set of points that is not the region itself 
but a related surface of singularity in a massless theory. The above definition is simply the 
natural generalization of the example. 

One odd-looking feature is the definition of the surface associated with the purely ultra- 
violet region. For the example of the 4-parton graph of Fig. this was i?4, and in Sec. 
its associated surface was defined to be the whole of the space of final-state momenta. But 
now we will also want to be able to discuss this graph when it is the hard subgraph for 
a particular region of a bigger graph. We will be using a Taylor expansion operator that 
replaces the external momenta of the hard subgraph by massless momenta. Therefore it is 
more appropriate to consider the space of momenta to be the space of all momenta (on-shell 
or not), and then the surface associated with is most naturally the surface where all the 
external momenta of the graph are massless. This surface then has an obvious embedding 
into a bigger space of momenta when the graph is a subgraph of a bigger graph. The above 
definition simply generalizes this idea to all graphs. 

The inclusion of the limit points is an important technicality that lets us define the 
ordering of regions simply as set-theoretic inclusion. Physically the inclusion of limit points 
corresponds to including smaller regions in the definition of the defining surfaces. Initially, 
I found this to be contrary to my intuition. For example, in the case of Fig. ^ one might 
suppose the region R2 where ps and p4 are parallel should exclude the region where in 
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addition pi and p2 become parallel. The approximation to this region provided by applying 
certainly loses accuracy in the smaller region since it is no longer valid to neglect masses 
in the hard subgraph. However, the mathematics runs much more smoothly if one defines 
the surface that codes a particular region to include the surfaces that code each of the smaller 
regions. 

Definition 3 An ordering of the regions is defined by set-theoretic inclusion: Ri < R2 
means that the defining surface for Ri is strictly smaller than the defining surface for R2 . 

It is convenient to make explicit the concept of a leading region: 

Definition 4 A leading region is a region that gives a leading-power contribution to the 
cross section. A non-leading region is a neighborhood of a PSS that gives a non-leading- 
power contribution. 

Theorem 5 Feynman-graph interpretation of ordering for leading regions: Let Ri and R2 
be two leading regions for a graph T. Then Ri < R2 is equivalent to saying that the hard 
subgraph, Hi, for Ri is smaller than the hard subgraph, H2, for R2. Correspondingly, when 
momenta approach the smaller region Ri, some internal lines of H2 have propagators with 
small denominators 

Proof: This is an elementary consequence of the meaning of the collinear 
kinematics of the singular surfaces: If Ri is smaller than R2, then more lines of 
the graph have low virtuality in region Ri than in region R2. Thus fewer lines 
are far off-shell and the hard subgraph is smaller. 



Definition 6 The approximation associated with a region R is notated by Tj^T. In the case 
of a leading region, the definition of TrT was given in Sec. at Eq. ( K.^ ). Observe that 
TrF is defined to extract the first term in the Taylor expansion in the masses of the external 
lines of the hard subgraph for the region, and then to use massless phase space for these 
external lines. (So it is not a naive Taylor expansion.) For a non-leading region, we define 
TrT = 0. 



Theorem 7 Basic properties of TrF: The approximation T/jF for a leading region R has 
the form of a hard subgraph H{1), with massless external lines, times a product of jet factors, 
as in Eq. ( I^.^p or The approximation can be expressed in terms of the original phase- 

space measure, as in Eq. with the aid of a Jacobian factor. The Jacobian, Eq. (^.ISj , 

is analytic in a neighborhood of the surface R. 

Proof: These are immediate consequences of the definition, and all the neces- 



sary discussion is in Sees. IV A to IV C 



Definition 8 Let T be a general graph for the cross section. Then the contribution CuiT) 
associated with a region R is defined recursively by Eq. ^d.Sj) . This is the appropriate approx- 
imation applied to the original graph minus the contributions associated with smaller leading 
regions. (Observe that for a non-leading region C/j(F) = 0, so that only leading regions need 
be considered explicitly.) 
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It is convenient to define a related quantity, which is the original graph minus the con- 
tributions associated with smaller regions: 

Definition 9 

Cr{T)=T- Y: CniT). (6.9) 

R'<R 

A trivial consequence is that 

Cn{r) = Vn TuCRiT). (6.10) 

A related definition is: 

Definition 10 For a graph T, we let T be the graph with subtractions made for all of its 

leading singular regions: 

T = T- J2CrT. (6.11) 

J?<oo 

(Only leading regions contribute, since otherwise CrT = 0.) The notation R < oo means 
that the purely ultra-violet region is not included in the sum, i.e., that only singular regions 
are included in the sum. 

For example, the singular regions for the graph of Fig. ^ are Ri, i?2 and R^. In both of these 
two definitions, the bar over a quantity means that all subtractions are made except for the 
outermost one. 

The notation defined by Def. |rU| gives simple expressions for quantities we interested 
in. For example, applied to the hard subgraph for a particular region, it lets us write the 
contribution associated with the region as 

CrT = Y[Jr,VrTHr, (6.12) 

i 

where Hr is the hard subgraph for a leading region R, jRi are its jet subgraphs, and Hr is 
obtained from Hr by using Eq. (|6.11 ). 

It is perhaps worth reminding ourselves that THr is obtained from Hr by 

1. Setting the external lines massless and on-shell, and setting all the internal masses to 
zero, and 

2. Inserting a Jacobian factor corresponding to the transformation between the exact 
momenta and the approximated massless on-shell momenta. 

I will show that the sum of the contributions for the different leading regions gives the 
leading power behavior of F, so it is useful to make the following definitions: 

Definition 11 The asymptotic part ofV is 

KsyT = Y.CR{T). (6.13) 

R 

Here the sum over leading regions includes not only the regions corresponding to leading 
singularities of the massless graphs, but also the purely ultra-violet region, where all the 
external lines ofT are at wide angles. 
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The remainder is defined by any of the equivalent formulae 

Remr = r- Asyr = r-^c^(r) = {1-VT)T, (Q.U) 

R 

where the unsubscripted T in the last form denotes the Taylor operator applied to the whole 
graph, i.e., where the whole graph is treated as a hard graph, and similarly for the veto factor 
V. 

The main work of the rest of this section is to prove that the asymptotic part of F is indeed 
a good approximation to F, i.e., that F = AsyF x [1 + 0(m^/Q^)] plus power-suppressed 
terms from non-leading regions, or, equivalently, that the remainder is power-suppressed: 
RemF = F x 0(m^/Q^), again up to power-suppressed contributions from non- leading 
regions. 

The reason for making an explicit definition of Rem F is that it unifies the treatments of 
leading and non-leading regions: We will prove that in all regions Rem F is power suppressed. 

In the following theorems our aim will be to give estimates of the sizes of various quantities 
and of the errors in approximations, just as we did in specific examples. Estimates will be 
propagated through further manipulations. The actual integrands of Feynman graphs have 
an enormous range of sizes; for example near a collinear configuration some propagator 
denominators get small. This happens in such a way that an integral over the region gives 
merely a logarithmic enhancement in the cross section. To avoid having to explicitly state 
the sizes involved every time, it is convenient to use a notation ||7|| for a previously derived 
estimate of some graph or subtracted graph 7. An example of such an estimate is Eq. 
( p.l9| ), where for further manipulations in a general context, it would be useful to write 
II A — II = ||y4||M^2/Q^- The estimate of A, the unsubtracted graph itself, could simply 
be defined as the absolute value of A: \\A\\ = \A\. Note that these estimates are functions 
of all the relevant kinematic variables and that we will require the estimates to be positive 
definite. In a theory with spin, there could be "accidental" zeros in the numerator factors; 
such accidental zeros should be removed from the definition of ||7||. 

In addition it is convenient to define L{r) to be a value that is leading twist in all 
singular regions of F — it differs from ||F|| by being multiplied by a suitable dimensionless 
enhancement factor for each non-leading singular region. 

Error estimates such as these involve some notion of distance to a singular surface, and 
we must of course define what we mean by a distance. Since there is no metric on the space 
of momenta over which we integrate, the definition is not unique, but for the purposes of 
error estimation, the precise choice of definition will not matter. A suitable definition can 
be constructed by starting with a choice of coordinates: 

Definition 12 For each singular surface R, we choose "parallel" and "normal" variables. 
This means that we change the integration variables to have the form {k\\,k_i_), where k\\ is 
used to parameterize the surface R, and kj_ are normal variables. Thus the surface R is 
k± = 0, while the region corresponding to R is just a neighborhood of kj_ = 0. 

There is considerable freedom in the choice of these variables. One natural set for k\\ consists 
of: 

• The longitudinal momenta of the lines in the jet subgraphs, divided by Q. 

• The total energy of each jet, divided by Q. 
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• The angles between the external lines of the hard subgraph. 

These variables are arranged to be dimensionless and to have a range of values of order 
unity; this will aid the use of dimensional counting to compute the dependence on Q. 

The remaining variables, /c^, parameterize the deviation of momenta from the exactly 
singular configuration for the region. Given that we will wish to vary the virtualities of 
the external lines of a graph, an appropriate definition for the internal variables of a jet 
subgraph is given by the following steps: 

• As in Sec. let the total jet momentum for the subgraph be and let the final-state 
momenta be p^j. 

• Use light-front coordinates for the momenta in the jet, in the overall center-of-mass 
frame, with the z axis being chosen as the direction of the jet's total spatial momentum 
Ij. The + and — momenta of the final-state lines are = j ±p| j)/-\/2, and there 
are also four components of transverse momentum for each p^^. The axes are different 
for each jet subgraph. 

• The fractional -|- momenta, e.g., Pj'^/lj~, can be chosen to be among the parallel vari- 
ables for the singular surface R, as listed earlier, in place of the longitudinal momenta 
of the lines in the jet subgraph. 

• The — momenta and the transverse momenta form the normal variables. (This applies 
both to the final-state momenta of the jet and to any loops.) 

These variables have convenient properties under boosts along the jet direction. 

We will also need to consider non-leading regions. These differ from leading regions by 
the possibility of extra lines beyond the minimum of two joining a jet subgraph to the 
hard subgraph and by the possibility of a soft subgraph connecting to the jet and/or hard 
subgraphs — see [||, [l^. The above definitions apply equally to the hard and jet subgraphs 
of non-leading regions, and we need only supplement them by adding the soft momenta to 
the list of normal variables k±. 

To define the notion of distance, we make the following definition: 

Definition 13 We write the normal variables k±_ in terms of a single radial variable 
A and some "angular" variables k±. This can be done by writing kx = Xkx and 
k^ = X^k^/Q for each internal jet momentum, and k^ = \k^ for each soft momentum. 
Then we impose a suitable normalization condition on the scaled variables, for example, 
Z]jet iik'^T + \^i^t IQ\) + Z^soft i,ii\Koit = 1- Thus the normal variables k± have been 
expressed in terms of X, and all but one of the scaled momentum components^. The scal- 
ing, for reasons explained below, is different between the transverse and — components of jet 
momenta, and X is arranged to have the dimensions of mass. 



^ Observe the difference between a subscript T and a subscript _L. In this paper, the notation T will be 
used for the transverse components of ordinary Lorentz vectors, as usual, whereas the notation _L will 
only be used to indicate the very different concept of a normal variable associated with a region of the 
space of all the momenta of a graph. 
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Since there is no natural metric on a space of several momentum variables, this choice of 
variable is not unique. But the details do not matter. It is the scaling properties with 
respect to A and Q that will dominate our discussion. The normalization condition on the 
A;_i_ variables is not Lorentz invariant; this is satisfactory since the power-counting method 
and the derivation of the singular regions rely on the use of a preferred rest frame. 

The different scaling of the — and transverse components of jet momenta is chosen so that 
the square of a jet momentum fc^ = 2k^k~ — fc^ simply scales homogeneously, proportional 
to A^. This will be convenient in constructing error estimates that rely on the virtualities 
of the lines of a graph. Observe further that the scaled variables are dimensionless with a 
range of order unity, and that A ranges up to Q. 

For an example of the meaning of the variable A, consider the graph of Fig. There are 
14 variables of integration, which we split into 12 angular variables and the masses, M12 and 
M34, of two pairs of particles. The graph is independent of the 12 angles, so we only need 
to consider the two pair masses. 



In region Ri we can define Ai = \j + ^34- Since we have agreed to ignore the angles, 
there are no parallel variables for this region, and the normal variables consist of M12 and 
M34. 

In region i?2; "we can define A2 = M34, and this is the only normal variable. There is one 
parallel variable, M12/Q. 

For the general case we can write the integral over a graph 7 as 

j dk-f = Q^ j dk\\j dk^j y 7(A/m,A/Q,A;||,fcx). (6.15) 

Here we have taken advantage of the fact that the variables k\\ and k^ are dimensionless. The 
overall factor corresponds to the dimension of the integrated graph. The remaining part 
of the Jacobian for the transformation of variables has been absorbed into the integrand to 
give 7, which is dimensionless. The mass-shell delta functions for the final-state particles are 
treated as part of the integrand. The integrals over /c, k\\ and k± are all multidimensional, 
of course. 



The power-counting results of ||l0[ tell us how 7 behaves as Q — > 00 with A, fcy, and k± 
fixed. The leading power occurs for the regions specified by Fig. |^, for which 7 goes to a 
generally non-zero constant: 

Leading region: 7(A/m, A/Q, A^n, A;_l) ^ 7(A/m, 0, A;||, A;j_) 7^ 0. (6.16) 

All other regions give a negative power of Q: 



Non- leading region: 7(A/m, A/Q, /cii, A;_l) ~ — I , (6-17) 




where the power p is, for the case of our model, at least 2. 

Now A ranges from to order Q. When mass-shell constraints are allowed for, the 
minimum value is restricted to be of order m. When we perform the integral over all A 
in a leading region, a logarithm results from the region m -C A <^ Q, whose coefficient is 
obtained by neglecting the mass in 7: 

[Q dX 

Logarithm = / — 7(00, 0, /cy, /c^). (6.18) 

Jm A 



38 



Further logarithms result from the integrals over /cy and k±. 
For a non-leading region, the integral over A is roughly 

which is dominated by A ~ Q: the contribution from A ~ m is of order {m/QY. 

The primary operation in computing the asymptotics of a graph is the application of 
the Taylor expansion operator Tr for each region R. The definitions we have just made 
characterize this operator essentially as the extraction of the leading term in an expansion 
in inverse powers of Q, for large Q. The errors in the use of this expansion are associated 
with the neglect of A and m with respect to virtualities in the hard subgraph for the region R. 
For a generic value olk\\, the virtuahties in Hr are of order Q^, and the errors have relative 
size IQ^ . But when k\\ approaches values associated with smaller singular surfaces, some 
of the virtualities get smaller, and the errors get larger. To quantify this, it is convenient to 
define the distance of a point of momentum space from the defining surface of a region: 

Definition 14 Given a choice of normal and parallel variables for each R and of the overall 
scaling of the normal variables, we define the distance of a point k of momentum space to 
the region R as the value of X = X{k, R). 

To estimate the errors in our expansion, etc., we need to relate the distance to a singular 
surface to the virtualities of lines of a graph. 

Theorem 15 Let k be a point of the momentum space for a graph F. Then the smallest 
virtuality, Knin(fc); of the internal lines of T is at least of order the minimum of X^{k,R) 
taken over all singular regions R ofT. This means that the ratio minji^^ X'^{k, R)/Vaim{k) 
is bounded: it is less than a constant that is independent ofk. 

Proof: If we restricted our attention to leading singular regions, the proof 
would be quite simple, since in our theory, the leading regions correspond to 
groupings of external lines into jets. Then the distances to regions can simply 
be related, in order of magnitude, to the invariant masses of groups of external 
lines. 

But complications arise since we will actually need to handle momenta in 
non-leading regions, and for them the scalings vary for different momentum 
components, and the relation to invariant masses is not so clear. 

First, consider any particular singular region R. For generic values of k, the 
virtualities of the hard subgraph are of order and the virtualities of the jet 
lines arc of order X'^{k,R), if the region is a leading region, and hence without 
a soft subgraph. For non-leading regions there may be soft lines so that some 
jet lines get a virtuality of order QX. Hence for generic momenta at a distance 
A from the region, the virtualities of internal lines of the graph are at least A^. 

However, by varying k\\ and k± we can generally get lower virtualities, than 
the generic estimate, at a given value of A. 

Now fix k and find the singular region R^ that minimizes A. It may be that 
A; is a generic value for this region, in which case Knin(^) ~ X'^{k,R^) and the 
theorem is proved. We will need to prove that no other possibility arises. 
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So suppose the opposite, that Knin(fc) ^ A^(fc,-Rm)- Then some hnes of the 
graph have much lower virtuahties than is given by the generic estimate. Let us 
call these "anomalous lines" . Then we can make a small adjustment of k to put 
the anomalous lines exactly on-shell. But this corresponds to another singular 
region, Rn, say. Since some of the jet or soft lines of the original region Rm 
are in the hard subgraph of the new region, the new region is either larger or 
non-trivially overlaps with the previous one. 

Because the anomalous lines of Rm, i.e., those with especially low virtuality, 
consist of all the jet and soft lines of the new region, we will be able to show 
that the distance to the new region Rn is less than the distance to the previous 
regions, Rm, contrary to the definition of Rm- Indeed, if k is counted as a generic 
momentum configuration for the new region then Knin(fc) ~ which 
is just this result. If k is not generic, then we just repeat the above argument 
with Rm replaced by Rn- 



Theorem 16 Accuracy of approximation given by Tr for a leading region. Let 'j be a graph, 
and let \\'~f\\ represent an estimate of its size. Suppose k is a point which is a distance A from 
the defining surface of one of its leading regions R, and suppose that the minimum virtuality 
of the lines in the hard subgraph is Vff^min- Then 

7 - Tr7 = <^ \ VH,min ) (6.20) 

[ 0(||Tr7||) otherwise. 

The minimum virtuality VH,mm can be replaced by the square of the distance to the nearest 
singular surface smaller than R, if such a surface exists. Let us call this distance MB{k,R). 
If no smaller singular surface exists, or if the hard graph is a lowest-order tree graph so that 
it has no internal lines, then Mb is to be replaced by Q, and VH,mm by Q^. 

Proof: The whole point of the definition of Tr was to make a useful approx- 
imation to a graph valid as A — > 0, and this theorem formalizes how accurate 
this approximation is. 

Now Tr7 is obtained by replacing the hard subgraph for the region R by the 
first term in its expansion about the coUinear limit for its external lines. The 
jet subgraphs are completely unapproximated, so that they do not enter into 
the error estimate. The errors come from three sources: (a) the neglect of the 
external virtualities for the hard subgraph in comparison with the virtualities of 
its internal lines, (b) the neglect of internal masses in the hard subgraph, and 
(c) in the calculation of the massless phase space measure, the neglect of the 
external virtualities of the hard subgraph in comparison with Q^. 

The largest relative error comes from the internal lines of the hard subgraph 
that have the smallest virtualities; and these small internal virtualities arise 
precisely from neighborhoods of smaller regions than R. The error arises from 
neglecting both A^ and m^, so that we must insert the largest of these, i.e., 
max(A^,m^) in the numerator of Eq. ( |6.2(]| ). Since terms neglected in the prop- 
agator denominators have factors of external virtualities and mass-squared, and 
since the virtualities of the external lines are proportional to A^, the error term 
is quadratic, not linear, in A and m. 
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When the minimum virtuahty of the hnes of the hard subgraph decreases 
below or m^, the approximation to 7 given by Tr7 becomes totally inaccurate, 
and the error is simply estimated as 100% of the bigger of 7 and Tr7, which is 
normally the more singular quantity Tjfj. 

Thus we have proved Eq. ( 6.20 ). 

For the subsequent discussion, it is convenient to work in terms of the ge- 
ometry of the momentum integration space and hence in terms of distances to 
singular surfaces. Small virtuality lines in the hard subgraph are obtained by 
letting the parallel variables k\\ for the region R approach the defining surfaces 
for smaller singular surfaces. The surfaces are smaller, because the regions of 
momentum are more restricted: the external lines are already of low virtual- 
ity, and we are finding places where additional lines have low virtuality. Then 
essentially the same argument that led to Thm. |15| shows that the minimum 
virtuality can be replaced by the square of the distance to the nearest smaller 
singular surface. This quantity repeatedly appears later, so we give it a name 
Ml{k,R). 

Of course, if there are no smaller singular surfaces, and, in particular, if the 
hard subgraph is a lowest-order tree graph, so that it has no internal lines, then 
the approximations involve the neglect of and m? with respect to . In this 
case we simply define Mg to be Q^. 



Theorem 17 Accuracy of approximation given by Tr times veto factor. For any point k, 

7 - VnTnl = oih\\ "^^^^^''^') ^ . (6.21) 
7 VRiR-1 ^l^ll^lljnax(M|,A2,m2) i ^ ^ 



Proof: This is the same theorem as Thm. |1^, except that Tr7 has been 
multiplied by the veto function defined earlier, and that in the error estimate 
Tr7 has been replaced by 7. 

The veto factor multiplying T^7 ensures that whenever we get close to the 
massless coUinear singularities of Tr7, this quantity is replaced by zero. In that 
case 7 — VrTr7 is just equal to 7, and a correct error estimate is just 100% of 
7, instead of the commonly larger Tr7. 

The complicated looking denominator in Eq. (|6.21 ) is just to combine the 
two cases in Eq. (|6.20|). 



Definition 18 In Thm. 16, the quantity Mb played an important role. So it is useful to 



formalize it and to make some related definitions. We consider a graph T and a point k of 
its momentum space. Then 

• MB{k, R) is the minimum distance to the defining surface of regions R' that are smaller 
than R: Msik, R) = min/j/</{ X{k, R'). If there is no smaller region, then Mb is defined 
to he Q. 

• MB{k, r) (with the argument being the graph T instead of a region R) is the minimum 
distance to a singular region of T. Thus, MB{k, T) = mini{<oo A(/c, R). 
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Mext(fc, r) is the maximum of the square roots of the virtualities of the external lines 
ofT. 



The basic result that gives the error on our approximations is Thm. |16|. Its simplest 
application is to a graph 7 that has no singular regions. Then the only region is the ultra- 
violet region, and the difference between 7 and Tr7 only comes from the neglect of external 
virtualities with respect to in the calculation of the phase-space Jacobian. This is equiv- 
alent to the observation that the value of in Eq. ( |6.20|) is of order the maximum external 
virtuality, i.e., M^^^. If the graph has on-shell external lines, then the error is of relative 
order m'^/Q'^, and therefore RemF = 0(||r||m^/Q^). 

To prove this result for graphs with non-trivial leading singular regions, we will use 
an inductive (or recursive) approach, proving the desired result on the assumption that 
suitable results for the errors of smaller graphs have been derived. The necessary results are 
conveniently stated with the aid of the following definition: 



Definition 19 A graph T is well-behaved if 

T = 



\L{T) (6.22) 



for all momenta k whenever the external momenta ofT, which (for this definition) are not 
necessarily on shell, obey Mext{k,T) < MB{k,T). Here, as defined by Eq. ( \6.11\ ), T is the 
graph after subtractions for leading singular regions have been made. Also, as defined earlier, 
L(r) is a quantity like ||r|| modified so that it gives a leading contribution near every singular 
region. 

More informally, we are defining T to be well-behaved if the subtracted graph T is power 
suppressed near every singular surface. 

An equivalent formulation can be made in terms of the distances to individual singular 
regions. Thus T is well-behaved if 

r.o(L,r)^) (6.23) 

for all regions R and for all momenta k whenever the external momenta obey Mext{k, T) < 
MB{k,T). 

The idea of this definition is that F may have singular regions, where it is substantially 
enhanced, and that without subtractions the integrals near the leading singular regions have 
strong logarithmic enhancements, which diverge if m — >■ 0. The aim of the subtractions that 
are used to define F — see Eq. ( |6.11[) — is to provide a suppression in each of these regions. 



The above definition gives a criterion as to whether our definitions of the subtractions have 
succeeded in their aim, and so we will need to prove that all graphs are in fact well-behaved, 
in the sense of this definition — see Thms. |2l| and |2^. The ultimate results, including the 



factorization theorem will follow quite easily — see Thms. |20|, ^ and and Sees. [VI A| and 
IVm, etc. 



For our proofs to work, we have to consider non-leading regions as well as leading regions, 
and the above definition also applies to them. By definition, to say a region is non-leading 
means that there is a power suppression near the defining surface of the region. There is no 
need of subtractions for non-leading regions, of course. 



42 



The restriction to M^xtik, T) < Msik, T) is needed, because once the external momenta 
get further off-shell than is allowed by this restriction, it becomes unnecessary to ask how 
close the internal momenta k are to the singular surface; the external virtuality has already 
forced some of the internal lines to have at least comparable virtuality. 

The next theorem says that if all the leading singular regions of F have been properly 
subtracted, then a good approximation to the subtracted graph is obtained by applying a 
Taylor expansion to the dependence on its external momenta: 

Theorem 20 IfT is well behaved and Mcxt(fc, F) < MBik,T), then 

(1 - VrT)T = O (^L(r)^^fi^^^ (6.24) 



The left-hand-side is one of the equivalent forms of the definition o/RemF — see Eq. 



so this theorem is just stating that the remainder is power-suppressed (or, as is often said, 
it is of higher twist). 

Proof: T replaces the external momenta of F by on-shell massless momenta 
and replaces the internal masses m by zero. In addition it inserts a Jacobian 
factor for the transformation between massive and massless momenta. We now 



apply Thm. to F when the region R is the purely ultra-violet region for F, to 
obtain: 

Remr = (i-t.,r)r = o(||r||^|I)) (6.25) 

Here, we have used the fact that the distance to the defining surface of the 
ultra-violet region is of order M^xt- The desired Eq. ( p. 24 ) then follows from the 



meaning, Eq. ( |6.22| ), we gave to F being well-behaved. 
More intuitively, this proof can be summarized as 

• Away from all singular regions, the definition of T implies that a graph 
minus its Taylor expansion is suppressed by a factor of order Mg^^/Q^, 
which is already the desired result, Eq. ( |6.24|) . 

• As each singular region is approached, this suppression is worsened by a 
factor QVA2(fc,i?). 

• But because the hard subgraph of each region is well-behaved, by the hy- 
pothesis of the theorem, F is itself suppressed by a factor X^{k,R)/Q'^ rel- 
ative to F. 

• The last two factors cancel each other. 

Consideration of the worse case scenario for each value of k requires us to replace 
A2byM|(A;,F). 

Although this proof looks quite trivial, there is actually one point that is 
particularly non-trivial. This is in the application of Thm. That theorem 
applies most obviously to a graph or a function that is a simple product of 
propagators. But when the graph is a difference of relatively large quantities, 
the derivation requires more care. 

A simple mathematical example for these results is provided by setting 

- 1 . ^ 

(6.26) 



(m2 + Ml^) {m? + 2Ml^ + g2) 



where M12 is as an analog to a jet mass. The subtraction of the singular region 
M12 ^ Q is accomplished by writing 



r = r-TiF 

1 1 

~ (m2 + Mfa) (m2 + 2Mf2 + Q^) " {m^ + M^^) ' 

Here Ti sets m and M12 to zero in the l/{m'^ + 2Mf2 + Q^) factor. 

The analog to Eq. ( |6.24|) uses an operation T applied to F, and in this model 



it consists of setting m to zero: 

There are relative errors of order m? /MI2 in both terms separately. At first sight 
we only have 

which gives 0{Tm? / Ml2)- Since M12 may be much smaller than Q, this error 
may be much larger than the desired error, where the denominator in the error 
is Q^. However, the biggest parts of the errors are in fact identical, and this 
gives the correct result OiVm^ /Q"^). 

The general approach to this issue is to combine all the terms in F so that 
they are over a common denominator. Then the approximation is applied to 
each factor in turn, and hence the error is the relative error produced by the 
1 — T operation times the size of F. 



Theorem 21 Suppose all graphs smaller than F are well-behaved, in the sense of Def. 
so that F is itself well-behaved. 

Proof: The structure of the proof generalizes the treatment of the concrete 
examples in Sec. 0, and the proof will be illustrated with the aid of these exam- 
ples. 

If F is small enough that it has no singular regions, then Mb = Q and F = F. 
Then the definition of well-behavedness is trivially satisfied. 

If F does have singular region(s), let k be any point for F that satisfies 
Mext(/c, F) < Mb(/c,F), and let R be the nearest singular surface. We put the 
other singular surfaces R' into the following classes: 

(a) R' < R, 

(b) R' > R, 

(c) R' intersects R, but it is neither a subset nor a superset of R, 

(d) R' does not intersect R at all: RnR' = 
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Then 



Cr,t -CrT- J2 Cr,t 

R'<R R' in (b-d) 

-VRTR)CnT- J2 C'ijT, (6.30) 

R' in (b-d) 



where we used Eqs. ( |6.9| ) and ( |6.10| ). We need estimates for each of these terms. 



Note first that if is a non-leading region, then the subtraction term CrT 
for this region is zero, and thus that Tr is zero for a non-leading region. The 
necessary estimates for the terms corresponding to each class of R' are obtained 
as follows: 

(a) From Eq. (pIT^ ), 

(1 - VrTr)CrT = (jets of R) (1 - VrTr)Hr, (6.31) 

where Hr is the subtracted hard subgraph of region R. Since Hr is a 
smaller graph than F, it is well-behaved, by the hypothesis of the theorem. 
Moreover, because R is the closest singular surface to k, the propagators 
with lowest virtuality are in its jet (or soft) subgraphs, and so A(A;, R) ^ 
M^^t{k,H) < MB{k,H). Hence Thm. H gives 

(1 - VrT)Hr = o[l{Hr)^^^^^ , (6.32) 



and so 



VrTr)CrT = o{l{v) 



x\k,Ry 
MUk,r) 



o m 



(6.33) 



where the last statement follows since R is the closest singular surface to k, 
and the definition of Ms^k, F) is that it equals the distance to the closest 
singular surface. 

An example of this result is given by setting R = R2 for Fig. ^. In this 
case, Eq. ( |6.33| ) states that F — Fi — F2 = OlTM^^/Q"^) — as we saw at Eq. 



(b) Consider a singular region R' > R. We only need consider a leading region 
for R', since otherwise Cr/T = 0. The definition of Cr> gives 

Cr'T = Vr'Tr/C/j'F 

= {ietsofR')VR,TR,HR,. (6.34) 

Since R' > R the hard subgraph of R' is a bigger graph than the hard 
subgraph of R, and some of the lines of the virtuality that determines R as 
the closest singular surface to k must lie in this hard subgraph of R'. 
Moreover R' is a non-trivial singular surface, so its hard subgraph is smaller 
than F, so it is well-behaved, by the theorem's hypothesis. Since TriHri has 



its external lines on-shell, we can apply the definition of well-behavedness 
to give 



from which follows 



0(LiH„/Jim), ,6,35) 



C„,r = 0(L(r)Ml|l^). (6.36) 



An example is given by setting R = Ri and R' = R2 for Fig. ^. Then Eq. 
(|0^ ) states that T2 = 0[T{Mf^ + Ml^)/Q% 

(c) Suppose R' non-trivially intersects R. Just as in the previous case, we have 

Cn>r = ( jets of R' ) Vr>Tr^r,. (6.37) 

We are considering a singular surface that is closest to k. In this region a 
certain set of lines has low virtuality. Now in the factor TriHri, a certain 
other set of lines is set on-shell, and effectively this factor is being calculated 
in a neighborhood of the intersection of the two regions, R fl R' . This case 
reduces now to case (6). 

For Fig. ^, an example is given by setting R = R3 and R' = R2, so that 
Rr\R' = Ri, and Eq. ( |07|) becomes Fa = ©(FMfg/Q^)— gee Eq. 



(d) The final possibility is that R' r\ R = ^. The definition of R implies that k 
is close to R. Now, in CriT, the hard subgraph Hri has its external lines 
on shell, so k being near to R implies also that in the hard subgraph the 
momentum configuration is close to R' fl R. But there are no such points. 

We have shown that all the terms on the right- hand- side of Eq. ( |6.30|) are of 
order L(F)M^(A;, F)/(5^, and the theorem follows. 



Theorem 22 All graphs F are well-behaved. 

Proof: This is a simple consequence of mathematical induction applied to 
the previous theorem If there are graphs F that are not well-behaved, then 
there is a minimal such graph. Application of Thm. |21] shows that the graph is 
well-behaved, contrary to hypothesis. 



Theorem 23 The remainder is power- suppressed for an on-shell graph: 

RemT = o(^m^y 



(6 
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Proof: From the definitions, RemF = (1— T)r, and from tfie previous tlieorem, 
r is well-beliaved. Hence by Thm. BO 



Re.r^o(.(r,<f:I)).o(.,r)^). 



(6.39) 



It immediately follows that 
Theorem 24 AsyF is a good estimate ofT up to power-law corrections, i.e., 

T = XsyT + oUiT)'-^) 



Ec„r + o(L(r)^). 



(6.40) 



Theorem 25 On-shell subtracted hard subgraphs THji are IR safe. That is, in the integral 
over final-state momenta, no collinear singularities result when two or more of the external 
lines of a hard subgraph become collinear. 

Proof: This theorem is, of course, essential for effective perturbative calcula- 
tions. Because the masses of internal and external lines are set to zero by the 
T operation, the logarithmic enhancements in the collinear regions for the un- 
subtracted graphs become actual logarithmic divergences after the integral over 
final states. 

The IR safety follows from the theorem that any graph, e.g., Hji, is well- 
behaved: for then, in the neighborhood of any of the singularities, Hr has a 
power-law suppression compared with the unsubtracted graph. 



C. Factorization for the cross section 



We now have all the elements for use in the skeleton of the factorization proof in Sec. 
pi Q a definition of Cji{r) and a proof that the remainder in Eq. ( |3.3| ) is indeed power 
suppressed. The factorization theorem Eq. ( p.5| ) has therefore been proved. 

In Eq. ( [4.18|) , we expressed the factorization formula in terms of the phase-space integral 



for massless partons. Now we will reorganize this formula into a form that is particularly 
convenient for calculation, and especially for calculations by a MC event generator. It will 
consist of a hard-scattering cross section for making partons multiplied by the probabilities 
for each of the partons to hadronize to a particular state. The sum over the (conditional) 
hadronization probabilities is of course to be unity. 
To this end, we first define an integrated jet factor 

r dM"^ r 

Zifip/fin, m/fin) = j ^ j dL{pr, /,)e(/|//z^) J,(p,). (6.41) 

As before, Jj is the sum over graphs of the form of Fig. |T3|, and we integrate it over all 
possible final states with the cut-off function 9(Z|//i|,) = 6(M|//i|,) restricting the range of 
allowed jet masses. This integral is performed in the rest-frame of the final-state of the jet, 




i.e., we set Ij = {Mj, 0). The quantities fip and fiR are the factorization and renormahzation 
scales, and it is the cut-off function gives the factorization scale its meaning, since nothing 
else in Eq. ( |6.41| ) depends on ^p. 

Next we define a differential jet factor as 

dD{pf, If, ^^p) = dL{pf, m) e(M|//i|) f'^^;^ (6.42) 

This is the sum over fragmentation graphs completely differential in the phase space for the 
final-state particles. Note carefully that the phase-space measure dL contains the exact jet 
momentum Ij, not the approximated jet momentum Ij. To allow a probability interpretation, 
a normalization factor 1/Z is present so that the integral of the factor over final states and 
masses is unity: 

dM'^ r 

J dD{p,;lj) = l. (6.43) 

(This integral is to be performed in the jet's rest frame.) 
The hard scattering cross section is defined by 

dajv = K dL{l; g, 0) Hn Z{fxp)^. (6.44) 

This is the sum of fully subtracted hard scattering graphs times a factor Z{fip) for each 
final-state parton. We have included the factor K, and a phase-space measure so that da 
really does have the normalization of a parton-level differential cross section. Observe that 
Z plays the role of a wave-function renormahzation factor, just like the factor that would 
be obtained from the usual LSZ reduction method for a cross section. But note carefully 
that the correct procedure for calculating the hard-scattering cross section a is to use the 
integrated jet factor Z{fip) in place of the residue of the propagator pole that would be 
indicated by the LSZ theorem. 

With these definitions, the factorization formula Eq. ( [4.18| ) can be rewritten as 



a[W] = J2 d^Nii) X n / X n / dD{pj; Ij) x W{f) + non-leading power. (6.45) 



In the last line we have a hard-scattering cross section ctn and jet factors dD{pj). The 
integral over the jet momenta is expressed in terms of the massless momenta /, since we 
will calculate the hard-scattering factor from massless on-shell graphs (with the aid of some 
subtractions); this results in the disappearance of the Jacobian A compared with Eq. ( p. 51) . 

The power-law correction in Eq. ( |6.45| ) is of order the cross section times m'^/Q^, with 
the usual addition of a term associated with non-leading regions. 
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FIG. 14: Contour used to define the integrated jet factor Z{^f)- 



D. Infra-red finiteness of integrated jet factor Z{^p) 
The integrated jet factor Z{hf I jJ'R^'m/ hr) defined by Eq. ( |6.41|) is the integral of a cut 



propagator, Fig. [T3|. This impHes that, even though the integrand includes small jet masses, 
the integral is infra-red safe for large /Xi?, so that it is a good approximation to set particle 
masses to zero. The proof is the same as for an integral over the hadronic part of e^e~ cross 
section and for other similar quantities |]TU[. It goes as follows: The integrated jet factor is 
the integral over the discontinuity of the uncut propagator divided by 27r: 



Z{^ii 



Ci 2tc 



e{Myfxl)S{M^ 



(6.46) 



where the contour Ci encloses the cut of the propagator S'(M^), as in Fig. 14. The contour 
can be deformed into the complex plane to a region where the (complex) virtuality is of 
order fi'jp. Since the internal virtualities are now large, setting the particle mass m to zero 
is a good approximation that is valid up to a power-law correction. 

In the above derivation, we have assumed that the cut-off function is a theta function 
G(M^//i|.) = 6{fiF — M). If a smoother function is used, minor modifications in the above 
demonstration are needed. 



E. Renormalization group 



Let us use the term "factorization scale" to denote the scale fip that appears in the cut-off 
function 0(m^//i|,) in the above formulae, and let the usual renormalization scale be hr. 
The renormalization-group operator is, as usual. 



l^R 



d 



dfi 



R 



2 d nf \ 9 

dfiji dg 



1mm 



d 



dm? 



where 7^ is the anomalous dimension of the mass squared, and 

^9' 



m 

Then the integrated jet factor obeys 



5127r^ 



2 dZ{fiF/fiR]g{fiR)) 

f^R 



dfiR 



(6.47) 



(6.48) 



(6.49) 
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where the anomalous dimension of the parton field is 

li9) = ^,+0{g% (6.50) 

In QCD, the electromagnetic current is conserved and therefore has no anomalous di- 
mension. It follows that the hard scattering factor, with the appropriate factors of Z{^f) 
is renormalization-group- invariant. However, in our model field the photon-parton coupling 
does not involve a conserved current and it in fact has an anomalous dimension. Thus 
the renormalization group equation for the strong-interaction part of the hard scattering 
coefficient does have an anomalous dimension: 

I^R — 1 = f^R — 1 = -l3e[g)Hf^Z = -(3e{9)(^N/K, (6.51) 

where /3e is an anomalous dimension associated with the electromagnetic coupling: 

PM = (6.52) 



VII. MONTE-CARLO ALGORITHM 



A. Probabilities 



To obtain a Monte-Carlo algorithm, we observe that when the weight factor W{f) is 
omitted, the integrand of Eq. ( |6.45|) gives the probability density of the final states. The 
weight factor W simply gave a convenient way of unifying the derivations of cross sections 
into a derivation of a total weighted cross section. Therefore we can obtain any cross 
section by the following technique: (a) omit the weighting factor W{f), (b) multiply by 
the integrated luminosity, and (c) use a Monte-Carlo method to integrate and sum over the 
internal partonic variables {N, I, and Mj) and over the hadronic final states. Recording the 
events generated by the Monte-Carlo integration, together with their weights allows us to 
construct any binned cross section. 

Because of the factorization, the density for the final state is a product^° of 

• A density for particular partonic final states (variables N and /). This is given by the 
differential partonic cross section Eq. (|6.44| ). 



One annoying problem is that the cut-off on jet masses must be less than Q/N, for otherwise one can 
generate events that violate energy conservation. There are at least two solutions: (a) Set the cut-off 
to be smaller than Q/N^i^^, where A^max is the largest number of partons in the finite-order hard- 
scattering calculations that one actually uses. Then perform the somewhat complicated renormalization- 
group calculation that would enable the cut-off to be different for hard-scattering coefRcients with different 
numbers of final-state partons. (b) Permit a higher cut-off, but veto events whenever they are generated 
with Mj > Q. For consistency, a corresponding veto will need to be built into the subtraction terms 
in hard-scattering calculations. Note that the veto will only affect events in which a least one of the 
generated jets has an invariant mass of order Q. 
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• A (conditional) density for each jet mass Mj, given the partonic variables. 

• A (conditional) density for the final-state for each jet, given its mass Mj. 

Aside from the issue of negative weight terms, the densities are to be interpreted as proba- 
bility densities, that is they sum and integrate to unity. 

For determining the jet mass, it is convenient to define the integral over the jet factor Jj 
over its final states. With a normalization factor that will prove convenient, it is 

M2 f 

F{Mj; iiR,m{i^R), g{l^R)) = ^ J dL{Pj]lj)Jj{Pj]lj]lJ^R,m{iXR),g{jj.R)). (7.1) 

(The integral over the hadronic state is performed for a fixed value of the total jet momentum 
.) Apart from the normalization factor, this is just the cut propagator and it is simply 
related to the integrated jet factor: 



27r ^ dill 



(7.2) 



Hence the massless limit can be taken for large jet masses: F is infra-red safe. Notice 
that F depends on the renormalization scale, but not on the factorization scale. Since it is 
proportional to a cut propagator, it has anomalous dimension 7: 

1^1^ = -l{9{^^R)) F. (7.3) 
Then the distribution of jet masses is given by 

dMf Mf Z{ixf/ixr) 



(7.4) 



This is renormalization-group invariant, since it is the quotient of two quantities with the 
same anomalous dimension. 

If we set yUi? and ixr to be of order Q, then a pcrturbative calculation of the hard scattering 
is valid. However the calculation of probability density for Mj suffers from large logarithms 
whenever Mj <^ hr, so we must perform a renormalization group improvement for the 
calculation of the M? distribution. This enables F to be calculated with a renormalization 
scale Mj (a different scale for each jet): 



dP{M]-iiF) _ 1 F{M],Mj,g{Mj)) Q{M] / 14) _ \ ff^% dfj,'^ 



dMf Mf Z{fiF/f^R) 



exp 



/2 



(7.5) 



Observe that the renormalization scale fiR has been replaced by Mj in the numerator F but not 
in the denominator Z. Then a finite-order calculation of F is valid provided only that Mj 3> 
A, so that g{Mj) is small. Of course, if Mj is small, one must resort to phenomenological 
functions and models that are adjusted to fit data, just as in current event generators. I 
have set the renormalization scale to Mj for the jet, but any value of similar size would 
be equally good in principle. A somewhat smaller size would probably be best if the MS 
scheme is used, since the typical virtualities in the higher order calculation will be smaller 
than Mf. 
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This formula is very similar to the formulae used in standard MC event generators. For 
example, the integral over the anomalous dimension corresponds exactly to the Sudakov 
form factor in the standard LLA formulation 0, with the anomalous dimension being an 
integral of the DGLAP evolution kernel over the splitting variable z. To leading order, the 
XjZ factor can be replaced by its lowest order term, unity. Also, it can be verified that at 
leading (one-loop) order F is the anomalous dimension of the parton field: 



Thus the leading approximation to the distribution of iWj is 



(7.6) 



dP{M]-iip) 



2. 



l{g{M,)) e(M2//i|)exp 



A/2 /i'^ 



(l + 0(^(M,)2)). (7.7) 



However, if one goes beyond leading order some differences become apparent. The Z 
factor is no longer unity, and F is not the same as the anomalous dimension; both of these 
quantities are nevertheless perturbatively calculable (if hf and Mj are large). 

So far we have discussed the hard scattering probabilities and the calculation of the values 
of Mj for each jet. The final ingredient is the distribution of the final states of a particular 
jet given Mf 

dL{pf lj)JjiPj, Mf fXR ■=Mj) 

F(M,;/.,:=M,) " ^'''^ 

The denominator factor ensures that the integral over all final states is unity. In addi- 
tion, the numerator and denominator have the same anomalous dimension, so that dJ is 
renormalization-group invariant. We may therefore set the renormalization scale to Mj, as 
indicated. Since the denominator is an integrated quantity with external mass Mj, it can 
now be calculated perturbatively without large logarithms. The numerator has an external 
mass Mj with an equal renormalization scale. 

Because the numerator is exclusive, a fixed-order perturbative calculation suffers from 
large logarithms. However it is an object of the same type as the cross section that we 
originally tried to calculate: The problem of generating the final state from the fragmentation 
of the jth parton given its invariant mass Mj is essentially the same as the calculation of 
the final state of the whole event given Q. So we must recursively apply our algorithm to 
each jet. 

To this end we need a factorization formula for the jet, and the only difference between 
this formula and the corresponding formula for the cross section is that the hard-scattering 
cross sections dajy are replaced by evolution kernels dJjy: 



dJ = '^^ dj^ X Jet factors. 



(7.9) 



N 



where the jet factor is the same as in Eq. ( |6.45| ). Note the sum rule: 



1. 



(7.10) 
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B. Algorithm 

Since the number of variables to specify the final state is large, the use of Monte-Carlo 
techniques for the integration is appropriate. We have formulated the cross section as a prod- 
uct of factors, so that the following algorithm generates events with the correct probability 
distribution: 

1. Generate a partonic final state according to the distributions given by the hard- 
scattering cross sections da^ defined in Eq. ( |6.44| ). Let the state contain on-shell 
partons, and let their momenta be Ij (for j = 1, . . . A^). If the hard-scattering cross sec- 
tions are positive, then unweighted events can be generated, with the hard-scattering 
cross sections being treated as determining a probability distribution. If the hard- 
scattering cross sections are negative in some region, then appropriately weighted 
events must be generated. 

2. For each parton j 

(a) Choose a value Mj for the resulting jet, according to the distribution in Eq. ( [7.51) . 

(b) In the rest frame of the jet, construct the jet by a call to a showering algorithm. 
This starts at step 1, using the probability density dJjq in Eq. ( [7.9|) instead of 
the probability density that corresponds to daf^. 

(c) In its rest frame, let the total momentum of the jet be 1'^^ = {Mj, 0), and let the 
final-state hadron momenta be 

3. After all the jets have been constructed, apply a boost to each jet in the direction of 
its parent parton. Arrange the boost so that the total momentum of the jet in the 
overall center-of-mass frame obeys Eqs. ( [4.9| ) and ( [4.10| ). 

In this algorithm, the showering of each parton is performed in the rest frame of the parton, 
with appropriate variables being the angles of the resulting partons. In contrast, many 
current event generators ||^ use a momentum fraction variable, since the fragmentation is 
conceived of as being in a frame in which the partons are fast moving. The two views are 
necessarily equivalent. 

In order to have a solution of Eq. ( [4.10| ) for A, we must require that J2j Mj < Q. If we 
are to get results that are independent of the order in which the jets are fragmented, this 
implies that we most impose the condition Mj < Q/N for each jet separately. This implies 
a maximum to the possible values of np. Alternatively, as explained in footnote 0, we can 
apply a veto whenever J2j Mj > Q, and insert a corresponding correction in the subtraction 
terms in the hard-scattering coefficients. 

As is usual in perturbative calculations we have a choice of renormalization scale, to which 
is now added a choice of the cut-off function Q{M^/ fip). The choice should ideahy made 
with the aim of minimizing the impact of higher-order uncalculated perturbative corrections. 
The cross-section is invariant under changes of this choice if the cross section and showering 
are calculated exactly. 

But we have another choice in the algorithm. This is the correspondence between the 
approximated and unapproximated jet momenta at step ^. Any change in this correspon- 
dence amounts to a change of factorization scheme, and is compensated by corresponding 
changes in the subtraction terms that are used in the hard scattering cross section and in 
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FIG. 15: Diagrams up to one-loop order for jet factor. 



the splitting kernels. Such a choice always occurs in a Monte-Carlo event generator, but the 
necessity to construct a systematic algorithm for higher order corrections appears to me to 
restrict the range of simple choices. 

The definition I have used — Eqs. 
that: 

• Conservation of 4-momentum is obeyed at every stage. 

• The renormalization of momenta is symmetric in the jets, and in particular it should 
not depend on the order in which the jets are fragmented. 

• The fragmentation of a jet can be done independently of the momentum renormaliza- 
tion. This implies that the momentum renormalization amounts to a Lorentz trans- 
formation of each jet, but a different transformation for each jet. 

• We must also allow for the possibility, in a more complicated theory than 0^ , that there 
may be polarizations for the partons. In that case there are Lorentz spinors and vectors 
associated with the partons. To avoid complicating the Lorentz transformations that 
implement step |^ of the algorithm, we can require that the Lorentz transformations 
be simple boosts, for which the only natural direction is Ij. 



O)-(4.10) — seems to be the simplest if one requires 



VIII. EXAMPLES OF CALCULATIONS 

In this section I summarize the results of calculations that give the lowest-order quanti- 
ties and the simpler next-to-lowest-order quantities. I assume throughout that the cut-off 
function defining the factorization scale provides a sharp cutoff: 

e(MV/i|) =^(/iF-M). (8.1) 



A. Integrated jet factor 



Since the fully integrated jet factor is infra-red safe, it can be calculated in massless limit 
from the graphs of Fig. p5[ 
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Here I used dimensional regularization with a space-time dimension 6 — 2e. The last factor 
in the braces arises from the integral over the one-loop graph for real emission, and MS 
renormalization has been implemented by setting the bare coupling to be 

«2 = 9l(4?) (1+ pure-pole _e.„.s), (8.3) 

where the counterterms are poles at e = 0. 

We also need the jet factor F for non-zero jet mass. In the massless limit in the physical 
space-time dimension, it is 

As stated earlier, this equals the anomalous dimension 7 at the leading order. 



B. Distribution of partons in showering 



The square of the order g graph for parton splitting gives a uniform angular distribution 
in the rest frame of the parent parton. Thus the lowest order for the splitting probability is 

c^^2 = ^ + 0(/), (8.5) 

where dVL represents an element of solid angle in 5 space dimensions, and the denominator 
is the total 5-dimensional solid angle. 



C. NLO hard-scattering coefficient 

The two-parton hard-scattering cross section — see Eq. (|6.44|) — is obtained from the 
lowest-order graph of Fig. |l], from its one-loop virtual correction, and from multiplying 
the result by the square of the jet factor Z. The whole calculation is done in the massless 
limit and the result is 

da2 = KdL{ii, 12] q, 0) 

where K is the standard overall normalization factor for the cross section. 

The lowest-order amplitude for the three-parton cross section is given by the sum of 
the three graphs in Fig. |T6[ Since we are computing the hard-scattering coefficient, the 
external momenta are the approximated, massless parton momenta, /j. Before subtractions, 
the squared amplitudes give 



1 + 



647r3 



(8.6) 



(8.7) 
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FIG. 16: Amplitude for three-parton production. 



Here M^^ = {h + kf, etc., are the masses of pairs of partons. When integrated over the 
parton kinematics, this squared amphtude would give the well-known logarithmic collinear 
divergences. 

We have already calculated the subtractions for the square of one of the graphs in Sec. 
V B|. There are two other terms, obtained by permuting the labels for the final-state partons. 



It is convenient to use the conventional dimensionless variables for the parton energies in the 
overall center-of-mass frame: Xi = 2Ei/Q = 2l^/Q. These lie in the range < < 1, and 
they obey X1 + X2+X3 = 2. In terms of these variables the jet masses are = ~ 2:3), 
etc. Then the final result for the subtracted, massless 3-parton cross section is 



d&s = KdL{h,l2,h]q,0)-^ 



+ 



1 — Xi 1 — 2:2 1 — x^ 



Xi)2 xlil — X2)'^ x^l — Xa)^ 

(8.8) 

The factors of in the denominators of the subtraction terms result from the Jacobians in 
the transformation between three-parton and massless two-parton kinematics, as explained 
in Sec. 0. 



IX. CONCLUSIONS 

I have constructed a Monte-Carlo algorithm for e+e~ annihilation in a model theory that 
can be applied at arbitrarily non-leading order (and hence includes arbitrarily non-leading 
logarithms). The main features are: 

• Corrections to the hard-scattering cross section and to the kernel for parton showering 
are defined by a subtractive method to all orders of perturbation theory. 

• The subtractions are applied point-by-point in momentum space, so that well-behaved 
functions are obtained that can be used in the calculation of arbitrary observables, 
without any restriction to infra-red-safe quantities. 

• A modified LSZ prescription must be used for the normalization of the hard scatter- 
ing coefficient (and the corresponding parton showering kernels). The normal LSZ 
prescription requires the on-shell amputated amplitudes to be multiplied by an ap- 
propriate power of the residue of the external parton propagator. Instead the on-shell 
amputated and subtracted amplitudes are to be multiplied by an appropriate power 
of an integrated jet factor. 
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• Exact momentum conservation is enforced at each stage, with a definite prescription for 
relating exact parton momenta to the corresponding approximated massless momenta 
that are used in the calculation of the hard-scattering and splitting coefficients. 

• The produced jet masses are restricted by a cut-off function, and there is a 
renormalization-group-like invariance under changes of the cut-off function. 

• The showering algorithm is rather different from a conventional showering algorithm. 
It is applied in the rest frame of the generated off-shell parton. The interpretation 
purely in terms of DGLAP kernels does not appear to hold beyond the leading- 
logarithm level. 

Proofs were given that the errors are power suppressed. Thus the accuracy of the Monte- 
Carlo calculation is completely equivalent to that of conventional analytic calculations of 
inclusive processes, and so all the analytic (or matrix-element) results can be used, with a 
change of scheme, but now with the advantage of being able to treat arbitrary final-states. 
The method of proof appears to be capable of considerable generalization. 

The use of a subtractive method to construct the corrections means that the subtracted 
higher-order corrections are not necessarily positive. So the algorithm must generally be 
used to generate weighted events. However, as explained in Ref. 0, adjustment of the 
cut-off function can be used to reduce the number of negative weighted events. 

Of course, a model field theory is not QCD, and future work should aim at extending the 
results to QCD, as well as to lepto-production and hadro-production. The main problem 
here is well-known: that soft-gluon emission occurs at the leading power in any gauge theory, 
and the treatment of the leading regions needs to be considerable generalized. Unlike the 
case of inclusive cross sections, one cannot appeal directly to a cancellation of soft-gluon 
effects between real and virtual emission. One symptom of this problem is in the definition 
of the jet factor Z . In our model field theory it is given by a simple cut Green function of 
the parton field. But in a gauge theory, such a definition is not gauge invariant. 

Collins and Hautmann ^ have proposed some ideas at the one-loop level for how these 
subtractions should be performed in a way that allows gauge-invariant definitions of the jet 
factors and the similar initial-state parton densities. These definitions would represent a 



likely improvement on those by Collins and Soper ||T2 . 

Of course, coherent angular-ordered showering, such as is implemented in HERWIG, 
solves the problem of soft-gluon emission at the leading-logarithm level and somewhat be- 
yond, but it does not represent a complete solution to the problem of obtain non-leading 
corrections in general. 
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